i 
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<210> 1 

<211> 733 

<212> UNA 

<213> Homo sapiens 

<40Q> 1 1 



SO 



130 
24Q 
300 
360 
420 
43Q 
240 
600 
660 
720 
733 



gggatccgga gcccaaatcr tctgacaaaa ctcacacaig cccaccgtgc coagcacctg 

aattcgaggg tgcaccgtca g-cs;cctc; tccccccaaa acccaaggac accc~catga 12 0 

tctcccggac tcccgaggtc acacgcgtigg tgg-ggacgc aagccacgaa gaccccgagg 

tcaag-tcaa ctggtacgcg gacggcg^gg aggcgcacaa Cgccaagaca aagccgcggg 

aggagcag^a caacagcacg taccgtgtigg tcagcgtcct caccgtcctg caccaggact 

ggctgaatgg caaggagtac aag-gcaagg tccccaacaa agccctccca acccccatcg 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accccgcccc 

catcccggga tgagctgacc aagaaccagg tcagccigac ctgcctgg-c aaaggccccc 

atccaagcga caccgccgtg g'agtgggaga gcaatgggca gccggagaac aacsacaaga 

ccacgcctcc cgtgctggac tccgacggct ccttcttcct ccacagcaag cccaccgtgg 

acaacagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcac gaggc-cugc 

acaaccacca cacgcagaag agcccctccc tgtctccggg taaacgagtg cgacggccgc 
gac-ctagag gat 

<210> 2 
<211> S 
<212> FRT 
<213> Homo sapiens 

<220> 

<221> Site 

<222> (3) 

<223> ICaa e<r/J.als any of Che cwenty naturally ccurring L-anu.no acxas 

<40Q> 2 

Trp Ser Xaa Tro Ser 
L 5 



2 



<2ia> 3 

<211> 35 

<212> DNA 

<213> Homo sapiens 

<4QG> 3 

gcgcctcgag acctccccga aacctagat" tccccgaaat: gac"ccccg aaacgacc^c SO 
c=cgaaacac ctgccacccc aaccag 8 6 



<21Q> 4 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<4QQ> 4 

gcggcaagct tcccgcaaag cc-aggc 27 



m 
s 



<210> S 

<21i> 271 

<212> DNA 

<213> Homo sapiens 

<4G0> 5 

ctcgagatcc ccc^gaaacc cagac-~ccc cgaaacgat" uccccgaaat gatitcccccg SO 

aaatatictigc catcicaacc agccagcaac caeagccccg cccccaaccc cgcccacccc 12 0 

gcccctaact ccgcccagtt ccgcccattc cccgccccac ggctgactaa ttc~ccccac 130 

ccatgcagag gccgaggccg cctcggcc~c Cgagctactc cagaagtagc gaggaggcct: 240 

tcttggaggc ciaggctctc gcaaaaagcc t 271 



m <21Q> 6 

U| <211> 32 

p| <212> DNA 

St <213> Homo sapiens 

<400> S 

gcgcccgagg gacgacagcg acagaacccc gg 



<210> 7 

<21i> 21 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gcgaagcctc gcgacccccc ggatccgccc c 31 



<21Q> a 

<211> 12 

<212> DNA 

<213> Homo sapiens 

<400> 3 
ggggaccctc cc 



3 



<2ia> 9 

<21I> 73 

<2I2> DNA 

<213> Homa sapiens 

<400> 9 

gcggcctcga ggggaccccc ccggggactG cccggggact: cnccgggacr ctccarcc^g 60 
ccatctcaat tag 73 



<21Q> 10 

<211> 2SS 

<212> DNA 

<213> Heme sapiens 



<4QQ> 10 

ctcgagggga ctttcccggg gactccccgg ggactttccg ggactctcca cczgccatcc 

caactagcca gcaaccacag ccccgcccc- aacnccgccc a-cccgcccc naactccgcc 

cagttccgcc caccctccgc cccacggctg accaatccct cccaCCCaCg cagaggcega 

ggccgccccg gcctc^gagc Catcccagaa gtagtgagga ggccccctcg gaggectagg 
ctttiigcaaa aagctt 



SO 
120 
130 
240 
256 



<2ia> ii 

<211> 3239 

<212> DMA 

<213> Homo sapiens 



<4QQ> 11 

cccggg~cga 

eggaaac-gc 

caggtaacag 

cctggccctc 

gg-gactctg 

ccacc-ggac 

ggacctctgg 

tggccaagaa 

tcgcgtgcct 

gcaggcccat: 

gcgcaccc-c 

agecagggge 

aagegtc^gg 

tcgtggccgc 

acacgggccg 

cegtcgtcac 

gcattcttac 

cactggtgrc 

gcctgacc-c 

geggaacaca 

ctccctggtc 

gtaccacaag 

ggaggagacc 

agaggtgtac 

ctacatgtac 

ctaccacggg 

caccatccac 

eggcaaggeg 

cctggagaac 

cgcccacgac 



cccacgcgtc 
accccagacc 
gcccggcc-c 
agcacccgcc 
gccctcacga 
cgcagcctcc 
gcagcccgcc 
cagtgcgceg 
cctcgcgggc 
ccgggacccc 
cccgctccgg 
ggccaccgag 
ggccacgctg 

CtGGZCCHtG 

cgccactgat 
cgngcgcctg 
cctcataCCG 
ccaggagaca 
ggacaccacc 
gtcaaggtca 
accctcacgg 
aggc"cccca 
accagcgcca 
ccgcggaagc 
cacgcctggg 
ggccaccttg 
gagtttg-ccc 
acggccctgg 
tcctaccccc 
cacaagcacc 



eggegaggae 
cgagcccgcz 
gccccgiigga 
cacccccaac 
gtg-ggacac 
tggaggacat 
cgcaccgcag 
gggccccggc 
atetatgeca 
tggttctggg 
cggccgc-gt: 
goegaaggee 
cagaagccgc 
ctcatcgtigg 
ggcaccgtca 
c~ggccat:cg 
gecagactga 
agcttccttg 
adggccagcg 
cgggcgtggt: 
gctcccccat 
aagaggtcca 
tgaagactgt: 
tgcagcagg- 
geageggget 
ccatcccagg 
tgggagaccg 
tggggccccc 
Cggagggggg 
tgcaccgngt 



cgcgtccggc 
gcgcgccccc 
cgacgatggc 
cagcaggacg 
ccgcgtgacc 
ccgccacticc 
ctgcctgc-g 
ggccgcgggc 
Cggcgaagcc 
ccctg- tcg~ 
ccaccgtzgcg 
tccctgggag 
"ctcctiacac 
cagccctggg 
tccagaaaag 
gcagctcatt 
acatccgccc 
atgagaaccg 
acGtggUccc 
ggcccccacg 
catcacgacg 
gaacgcGCcg 
ccggagcttG 
gcacaagccg 
cacac-gctg 
ccagatgacc 
Cacggagaat: 
gggcagtggg 
ccgggtgccg 
gatctccccg 



gcagtcctca 
CGCcagagcc 
ctitgccGccg 
cggctgGgga 
acggccatcc 
aacatctccg 
ccggggagcc 
ctcgcggctg 
gctgc^cccc 
gtggacguac 
gccaggcacc 
cggccggcca 
caagcccgac 
agagaccctc 
ca^ggaccag 
cgccgcaggt 
icgaaact-gt 
cacaggggac 
cagaacacca 
etcagccccc 
gcg-GGaaca 
gecagagega 
gecaacgagg 
aacaggaagg 
gtggtccagg 
agcggcaacc 
gcctccctca 
aagagctcct 
ctggacggca 
gugagccagg 



atgagcagcg 
cacctzggngc 
Cgagctacaa 
aggcggtggt: 
acg~ct:tcag 
actcggtgcc 
accactggcg 
gtcatcaccc 
tcagagg-gc 
actccactcg 
caggccGtgg 
ccgcccgaac 
guggccctcc 
ccgccccacz 
etcagcaegg 
acccggggcg 
cccttccgct 
GZcaCctccc 
a:gccttccc 
catggcagcc 
tc^aeggcaa 
gcaacacggc 
aggaggaggc 
aggcagc~gc 
ccagcacGG" 
tcatcgcc" 
gcctgtccGC 
gtgtcaacat 
agccca:cag 
agcccgcgct 



SO 
'120 
130 
240 
300 
3 SO 
420 
430 
540 
600 

sso 

720 
730 

a4Q 

900 
960 
1020 
1030 
1140 
1200 
12SG 
1320 
1330 
1440 
15QQ 
1560 
1S2Q 
1680 
1740 
1SQ0 



4 



m 



gttcgcccgc cccaccacgg aCaacatctic c^acggcc-g cccaccgcgc ccttcgagat: 
ggtggtggag gccgcacaga aggccaaegc ccacggcccc accatggaac nccaggacgg 
c-acagcaca gagacagggg agaagggcgc ccagccgrca gg-ggccaga agcagcgggc 
ggcacggccc ggg.ccctgg- gcggaacccc ccagzccZca tcctggacga agccaccagc 
gccttggacg ccgagagcga g-a.tczga.tc cagcaggcca cccacggcaa cctgcagaag 
cacaccg-ac ccaccatccc gcaccggcug agcaccgtgg agcacgcgca cctcactgzg 2130 

2220 
223G 



1350 
1920 
1930 
2Q40 
21Q0 



gUgcUggaca agggccgcgt: agcgcagcag ggcacccacc agcagccgcc ggcccagggc 
ggcctctacg ccaagccggt: gcagcggcag atgctggggc tccagcccgc cgcagacccc 
acagctggcc acaacgagcc cgtagccaac ggcagtcaca aggcctgacg gggggcccc- 22 40 

gcttctcccg gcggggcaga ggacccggcg ccZgcczggc agaCg-gccc acggaggccc 24Q0 
ccagctgccc Zccgagccca. ggcctgcagc actgaaagac gacc-cgccaC gtcccacgga 
Zcaccgczzc czgcaccccg cccctggtcc ctgccccacc cccagggcac zcctzacccc 
tgctgcccng agccaacgcc cccacggacc tccctagcct cc-aagcaaa ggcagagczg 
ccttttnaaa cctaggcct:- accaggg-ct: tcaccgcttg gtccgaggca ccccagtcaa 
ctcccagatt tcaaaaacc- tttcctaact gggagnaa^g gcgggcaccc Ccaccaagac 2700 
gttctagaaa cttc-gagcc aggagi:gaac ggcccZZccz cagnagccng ggggaegccc 27 50 

agagactaag ccccccccct ctacccctcc agagaagggg cttccccgtc ccggagggag 2320 
' acacggggaa cgggatctcc cgzczcZccc tcntgccagc tccg.gagcc cggccagggc 
gggttagggag cgtggagggc acccgtctgc caCcgcccgc tgccaaccta agccag-cCc 
accgtgaacc acacgaaacc ccaactgggg gagcgagggg ct:ggccaggt ctggaggggc 
ctcaggggcg cccccagccc ggcacccagc gccctcgccc cccgtccacc cacccctgg.? 
tagcagcctc cctccccaca cccgcccctg tgccctgccg tctggaggcc acgtggatg- 3120 
O tcatgagacg cauEcccccc cgtcc-cggc ggacgggatg g-ggcaaagc ccaggatc-g 3130 
\M gctccgccag aggccgcaac acgctrgagag aacccggtca acaaag-gta ccacc;c::a 3 240 

JE cccctaaaaa aaaaaaaaaa aaaaaaaaaa aaagggcggc cgctctaga 3 23 9 



2450 
2320 
2S30 
2540 



2830 
2940 
3QQ0 
3050 



<210> 12 
<211> 2342 
<212> DNA 
<213> Homo sapiens 

<400> 12 

ggcacgagct cagatctcCc ccggtacccc tzccccacqc cctcagaCaa tccatctcaa 

ttcctcatgc caactgagga gctacggccg caaggcacc; cccaggancc cacacctaca 

caaazctccz zzzzczcczz ZZgccZZczc tgc-cacggg acaccccgag tccccacccc 130 

caatcac-ga cagc'gggcc ccctccaica gccccacaca ccacgtacta agtcagtcrac 

aaickcccc; ctcainczaac cgccggatcc gzctzzctac acacacccaa tgatccacgg 

ctcctccggc tgamc-accc acatiggacac agnccaacgt: gagcccacac accCcttacc 

tggctatacc actctcggcC gacccacccc cgaaamcggc tcataacccg gaaactcagc 

taaacacccc tccttcaaac Ccagccctcc cagcatgg-c tcaggcaggg ccgtcgccc- 480 

ccttcaccta aiagcccc-g gaccaacaic catacaaacc aacacagcac cacccaaaoc 540 

accaatatgg gggtaccact tccactcaaa cctccctcaC aiccccaccc cccccatgtc 600 

tcagccgaac ctaccctaac ccagcccacg ccacaacggc gggacaggtc ccccagtccc 
tatgcggnct tat 



SO 
120 



240 
3 00 
3 50 
420 



550 



xctacc cctgcacccc ccgtagacca zca.azzcza.c accccaatCa 720 



caaaatca:a cccacctc^g cctggcagaa ggtgccacgc tcecccggcc cgcctaccac 730 

ccacaca:cc ctacacctca ccaccggaic ccct:cccc: ccc^ccaic caacccctgg 840 

czzccccgcz gccaaccctg ccccccacgc ccccag-cca aagg~gcccc ctggaaaaaa 9Q0 

tgtaacaacc cccucacccg ugactggcac ctgacagcoa ccacaccggg gcagcaatgg 950 

ccaacgg-tg acaaagacaa ZzZcZZZCZC cccccaaaac caaacagcct tcatcaactc 1020 

cctagccaag accccctacc aggcccscac aggcgccgcc ctggccggca gtcacccaac 1030 

ttgggaaaac gaaaacaccc caccatggta cctacccica cctacaactc tgctgccacc 1140 

ccca.gzzczc tttcgcg-ga Cacaaccga" accckgccca ccagccaacc ggtcaggaac 1200 

ttgcaccccg c-ccntcagg ctccaaccat caacacccca ccccctaacc aaactactct 1250 

aacccctgca gaagccccta zczcczcztc acccacaaga aacaaacggg czcza.cz.zcz 1320 

caccaccccg ctaacaggac taggcaccac tgccgcaccn ggcactggaa tagcaggca". 1330 

aaccacccca a:caccncat accaaacacc acccacaacc ccttccaaca ccgcagaaga 1440 

catgcacact tccactacca gtctccaacg acaattagac zzcczcqzgg gagzcatccZ 1500 

tcaaaaccgg agag-cctgg acccccnaac caccgagaaa gggggcacct gcacataccz 1550 



ccaggaagaa cgccgcttcm g-gt:t:aaCga acccggcatc gczcacatcg cag-ccg-ag 1520 
gctccacgac agggctgcag agc~ccgaca tcaagrcgcc gactcc-ggc ggcaaggacc 1S3Q 
aiiccctcc-a agatggacac cctgggnngc cccccccctia ggacccc~ga tccucccccc 1740 
cctgttacca acgatcgggc cacgcacacc caaccctg-a ccccgcctca tcccccaaag L300 
gctgaaCCcrc t,c~acccagg caagcatgca aaaacacatt gaeaatatac t~cacccccg 
ccacgtctaa taccagagcc tacgaggaaa ccattcggaa gcuccagaac ccaggcccca 
atcacaacgc ccctiatccag caggaagcag ccagatgatc aacgacgccc tctctccutt 1930 

2040 
2100 



<21Q> 14 

<21i> 2027 

<212> DNA 

<213> Hcmo sapiens 



1350 
1920 



2340 
2342 



ttacacnaaa gcaagaaaca agaatgttag cccaaac-gc actatc-cgc agacccctac 

catttcacaa actgg-caga gcggaaaact ccaccagggc czgagc^gcg agaaacaccc 

tgtcaggcag g-cccaggcc taaccccugg ccgcactaaa cccccccac- atcagcagcc 2 ISO 

aaacacaccc: cccccacccc atccccacaa caaccccaga cctctcccgc ccgggactgc 2220 

aactggtcca gcctg-aagc gggaaggggg c-ctggcact: agctggtacc ccccccccgc 2 23 0 

aggtcctscn cccaacaaac ccg-g-cgcc attgaaaaaa aaaaaaaaaa aaaaaaaccc 

ga 

<21Q> 13 
<211> 1556 
<212> DN& 
s . <213> Homo saoiaiis 

p <4G0> 13 

Cl ggcggagtcc gcacctccag c~cgggccga tgtggaagcz ttggagagct gaagagggcg SO 

U1 cggcggcgc- cggcggcgcg ctczccccgc cgctc--cgc gccaggggcc cgccagcngc 12: 

£ tgaagcagag gcggccgacg ggcutccccc cggggccgcc ggggc^gcca ettatcggca 

Jk acatccatcc cctggcagcc Ccaeccgagc Ctccccacg- ccacatgaga aagcagagcc 



130 
240 



360 
420 



2 aggtgtacgg agagaccctc agcctaga-c ttggaggcac atcaaccgng gt:ccnaaacg 30Q 
zi gctiatiaatgc agtaaaggaa tgccC-g^Cc aicaaagcga aactc~"gca gacagaccat 
m gccttccccc actcatgaag acgacaaaaa Cgggaggcuc acccaattcc agaiatggcc 
s gaggatggg- tgatxacaga cgati-agccg caaacagtt::: ccgatatccc gga-atggcc 4a C 

O aaaag-G-i.C tgaatccaaa acctcggaag aaaccaaats t;;caatgac gcnac-gaaa -540 
•Li catacaaacg tagacctintn gactccaaac ag-caataac gaatgctgcc tcaaacataa S0Q 
m ccaatctgat catttttgga gaacgatcca cttacgaaga caccgacct- cagcacatga SS0 
IS tcgagttaiC Cagtgaaaac g^ggaactag ccgccag^gc ctcagtctcc tcgtiacaacg 720 
cctctccacg gat-ggcatc ccgccccccg gaaaacatica acagccgtcc agaaatgcag 730 
5 ctgcag-Gca tgattttctc tccagaccca icgaaaaagc ttcagtcaac agaaagcc-c 340 
iJ' agcsacctca gca&- czgnc gatgc-eatc caga-gagac ggatcaaggt: aaaaatgacc 
caCcatcCac cctctccaaa gaaaacctaa ttcccccagn gggcgaaccc accaccgctg 
gaactgaaac tacaaccaat g-gccacggc gggcgaccct CCtcatggcc cc'tacccca 1020 
acactcaagg acaagtecag aaagagaccg acecaactat gggccccaac gggaagcccc 1030 
cctgggacga caaacgcaaa atgcctcaca ccgaggcagt: tccgcacgaa gccccaagac 1140 
Cccgtaa-ac ag-Cccatta gggatccccc acgcaacccc cgaagatgca g-tgt:acg-g 
gtCatcccat tcccaaaggc acaacagtaa tcacaaatct tcatcctgea cactttgacg 
aaaagtactg gagagaccca gaagcgtecc aCcccgagcg acctctggac agcagtggac 12 2 0 

atcctgccaa gaaggaagcc ccggticcccc ttzccczagg aagaagacat tg~ct^ggag 
aacacccggc tcggacggaa acgrcctcgt tctccacagc attgct:ccag aggtincacc 
tgca;c;:cc acacgaac:a g-ccccagatc cgaagcccag grtaggcatg acattgcagc 



900 
9 SO 



1200 
12S0 



1330 
1440 
15Q0 



cccaaccctia cctcatctigt gcrgaaagac gctgaaaccg cctgggacgt cctcgggaac 1SS0 
aagaacgtac attcgcctta tccccgaact Cggcccaatc aaancaatgt gug-accaga 
ataaaagizca cagcaccaaa aagrocaaaaa aaaaaaaaaa aaaaaa 



1S20 
1€S5 



<22.0> 

<221> SITH 



6 



<222> (294) 

<223> n equals a,C,g, or c 



<22Q> 

<221> SITS 
<222> (1975) 

<223> a equals a,t,g, or c 
<220> 

<221> SITS 
<222> (1981) 

<223> c equals a,t,g, ar c 
<22Q> 

<221> SITS 
<222> (1385) 

<223> a equals a,t,g, or c 



<22Q> 

M= <221> SITS 

Q <222> (2021) 

ft <223> n equals a,t.g, or c 



<4Q0> 14 

ggcacgagnc gggagcagct ctgcgtgcgg ggcctcagag aacgaggccg gcg-ccgccc SO 

tgtgcctccc ctggcaggcg ctctggcccg ggccgggcgg cggcgaacac cccactgccg 120 

accgtgctgg c-gctcggcc "cgggggccu gctacagcct gcaccacgcc accacgaagc 130 

ggcaggcggc cgaggaggcc cgcaccccgc gaggtggggc gctcagcacc grgcgrgcgg 240 

gcgccgagc- gcgcgccgtg ctcgcgctcc tgcgggcagg cccagggccc gganggggcc 300 

ccaaagacct gccgttctgg gtcgcactgg agcgcaggcg tccccaotgc aaicccggaga 3 50 

acgagcctet gcgggg-Sc-c tccCggcCg" cctccgaccc cggcggtctc gaaagcgaca 420 

cgctgcageg gg-ggaggag ccccaacgcc cctgcaccgc gcggagacgg gtacteccag 430 

gccaccggcg ggg~cgagcc cgcagcngga aggagacgcg aigccacctg ygcgccaacg 540 

ccaccngcgc aagcaccagt ttgaggtcte gtgtcccgcg ccgcgccccg gggccgcccc 600 

taacccgagc tatcgcgcgc ccccccagct gcacagcgcc gctctggacc tcagrccacc 660 

tgggaccgag gtgagtgcgc tctgccgggg acagcccccg atcccag-ca cttgcaCcgc 720 

ggacgaaaCc ggcgctcgyt. gggacaaacc ytcgggcgac gtgttgtgcc cctgccccgg 730 

gaggtacccc cgtgctggca aacgcgcaga gctccccaac tgcctagacg acttgggagg 340 

ccttgcc-gc gaatgtgcta cgggcttcga gc-ggggaag gacggccgct c-cgtgcgac 900 

cagcggggaa ggacagccga ccctCggggg gaccggggtg cccaccaggc gcccgccggc 950 

cactgcaacc agccccgcgc cgcagagaac a;ggccaacc agggtcgacg agaagccggg 1020 

agagacacca cctgtccctg aacaagacaa ttcagtaaca tctaczcczg agattcc:cg 10 30 

a£ggggaeca cagagcacga Cgtctacccc tcaaacig-cc ccccaagccg agtcaaaggc 1140 

cactatcacc ccaccaggga gcgtgaCtCc caagttcaa; tctacgacct: cc-ccgccac 120 0 

tccccaggct tccgac-cct cccccgccgt ggcctccata tccgcgagca cagcagtagc 1260 

agtgttggcg accttgacca cgacag-act ggggcccgxic aagcnctgc- tccacgaaag 1320 

cccctcttcc cagccaagga aggag-ctac gggcccgccg ggctiggagag tgaccctgaa 13 3Q 

gcccgctgcc ttgggcccca gttctgcaca ctzgcacaaac aacgggg~ga aagccgggga 1440 

cngcgancCg cgggacagag cagagggtgc cttgccggcg gagtcccoCc icggc-ccag 150 Q 

tgacgcatag ggaaacaggg gacatgggca ctcctgtgaa cagtct^tca ctccigatga 15S0 

aacggggaac caagaggaac Ctacccgtgc aaccgacaat ttccgcagaa atcccccttc 1S20 

ctccaaatcc cctt-actcc actgaggagc caaaccagaa ctgcacactc cccccctgai: 1S3G 

gatagaggaa gtggaagtgc ctctaggatg gtgacaccgg gggaccgggr. agcgccgggg 1740 

agagacaccc CcccasgzcC aCccggagaa cC^ggagaag cgatcgaacc CCCcaagaca 13QQ 

tcggaaacaa atagaacaca acataattta cattaaaaaa taat~cc~ac caaaaCggaa 13S0 

aggaaatgtt ctatgttgtc caggczagga g-atiatcggc tcgaaaticcc agggaaaaaa 1320 

acaaaaacaa aaaaccaaag gatcgccgac aaaaaaaaaa aaaaagggcg gccgcxiccag 133 0 

ngggnccaag cttcacgzac gcgggcatgc gacg-caagc acctcca 2027 



7 



<21Q> 15 

<211> 2334 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> SITS 
<222> (2273) 
<223> n. equals a, 



t,g. or c 



<22Q> 

<22I> SITE 
<222> (2290) 

<223> - equals a,t,g, or c 



<4QQ> 15 

gggagcgtgg 

ggacgaggcc 

gccctgcaca 

ctgctgagcc 

ccaggccccc 

acctacgctc 

cgggccctga 

aggcaggtLt!; 

accagcctgg 

ccatgccaga 

cgccgcacgt 

gacacccaga 

ctccccatcc 

tgcaacacac 

agcgggccga 

gtiagactcac 

cccgtgatct 

gacgacaccc 

ggagcaatac 

atcaaggccg 

aaatcccctc 

agtcg-ggct 

ttcagacaag 

ttcccggaizg 

cagagtctga 

ccagccaag- 

gccaccttcc 

ctgcagcaag 

aaacaaadg" 

cctcgagggg 

cgtaCtataa 

cccggga-ct: 

acagaganc" 

gctgcatatg 

acacaggagc 

ttcaggcccc 

gaggtttiac 

aacgcaaccg 

caccacaaan 



ccgcagaacc 

ggggcctcct 

acacgcagcc 

cgccgccccC 

ccagagccct 

ccCcgaccac 

cgcgggactt 

accagaaagg 

acaggct-ag 

cccaggaccg 

gtgcc-ccca 

aattggccug 

Cacgnaccc" 

cctgggcaga 

ctgactctgg 

cccatgcctc 

tcccccaccc 

cgcagcctcn 

agcgcaaccc 

Ccattggazc 

aggggccgga 

ggagcgagga 

tggaaaaggn 

agcagctgag 

cttcaggcca 

ggtcagcctc 

cagtcctcat 

ccacagacac 

zczggacaca 

cccaagccca 

gctaggcacc 

c tgtgaagga 

aaagctccaa 

czzgczgczz 

cagtgaccct 

ccagtcctca 

tcgcctcaaa 

gtggcggCaa 

ctcccaaaca 



caggt:ggcag 

caaggaggcc 

c"ccggcccc 

gctgccgctg 

caccacgctg 

ac-cagtiagti 

cccgctcgtg 

gccacaggat 

agacggcctc 

ggatgccccg 

ttc"gagccg 

cctcaCcggi: 

ctacacgctg 

gagccccgcc 

tgagaaggcg 

agatgccgtg 

ggccgcccgg 

gaagaagaac 

accagccaat 

caagctcacc 

agacgtgtcc 

agagcttcag 

acaggaagaa 

cagttcccgc 

ggaacccact 

agagccctcc 

tcigcggctc 

cccacaaagc 

gaaaaaaaaa 

cgcgtgcatg 

ggccgtcgtit: 

accutacctc 

gg^aaatata 

gagagccccg 

aattg~ccgn 

cagtctgctc 

aaacctycca 

cc"gg~ caa: 

aagcact-" z 



ggcccccctc 

cuc-gactgc 

gagggucccg 

ccccagccig 

ggcgccccca 

cccagcaccc 

gacggccaca 

gccaacccgc 

gt:gggcgccc 

cgcctcaccc 

gagcttgtga 

gtagagggng 

ggagtgcgcc 

aagggcgtcc 

gtggcagaaa 

gcacggcggg 

ggtgtgtgca 

ggtggcgtcg 

gtgcccaccg 

gggaccggtg 

acacacccag 

ggtgtcatcc 

aacaaatggc 

cactccgacc 

gagattccca 

ccccacatgg 

tgacgaccca 

tcccccgtcc 

aaaaaaaaag 

cgacgtcata 

ttacaacgtc 

tgtggcg-ga 

aaattictCaa 

ctnactgagt 

gtatttiaga 

atgatcacaa 

cacctccccc 

ggagc" cata 

ccc-ggaacn 



aggcccctca 

C3.c<zctzzg<z<z 

gcacg^tcgg 

caacccgtgc 

gagcccacac 

agggccngca 

acgacc~gcc 

gcaaccticag 

agttctggtc 

tggagcagac 

cctcggctaa 

gccacccgcc 

acccgacgcC 

acucctccta 

tgaaccgccc 

ccc-ggaagt; 

acagtgcccg 

tgacggtgcc 

cggcagacca 

gagactatga 

tccwgacaga 

gtggaaacct 

aaagcccctc 

cctcacgcct 

tacaccggac 

ccccag-cct 

gccagtcctg 

gcaggcacaa 

ggcggccgct 

gctccccccc 

g-gaccggga 

caCaaGzgga 

gcg-acaacg 

acgac-Cacg 

tCGacagtcc 

Ccagccacac 

cgaacc-gaa 

atggcaccaa 

taaatggggg 



cCGG-gacGw 

tgcGtgccGg 

ccggcggccc 

ccacaccacg 

cacgccgggc 

agagcaggca 

cccggcGcta 

ctacggccag 

agcotacgcg 

cgacctca:a 

agctctgaac 

ggacaatagc 

cacccacacc 

caacaacatc 

gggcacgatg 

gzcacaggca 

gaatgctccc 

tctgtcca .g 

cCtcgaccac 

tggggccggc 

ggagctgccg 

gctgcgggnc 

ggaggacaag 

gcgccagaga 

agGGaagtita 

tgcag—gtg 

ccagacgcca 

atatetcctg 

ccagaggatc 

tatagcgagt 

gacGzgccag 

caaaczaccc 

tg-taaacta 

aaaaca:ca; 

caaggctcac 

cacacccgca 

acacaaaacg 

caaagcantg 

zzgg 



SO 
12 Q 
L3Q 
240 
300 
3S0 
420 

4ao 

540 

so a 

SSO 
720 
73Q 
34Q 
90 0 
960 
1020 
1030 
1140 
1200 
12S0 
132Q 
1330 
144Q 
1500 
15S0 
1S20 
1S30 
1740 
1800 
13S0 
1920 
1930 
2040 
2100 
21SQ 
2220 
2230 
2334 



<21Q> IS 



3 



<21I> 2603 
<212> UNA 
<213> Heme sapiens 

<40Q> IS 

ccacgcgccc gcacagggcc caggctgggc ccagaggcca ctgagatgee agctccttga 
gagcageggg ggtg-cccag gecagagcag ccttctccctc tcctgg-ccc agaaaaaccc 
tcacagtaaa cggtggcctc tgggtggtca cccctagcgc aggccccctc cctcccccca 
gctcc-cccg gtgccctggg cgctgggcgc tggacggacg ggcgtcccct agcccccccc 
ggngccctag atgctgggtg ctggacgggt gggcgtcctc cagcccctcc cggtgccctg 300 
ggtgctgggt gctggacggg tgggcgccac ggtgcaccct tgtcgggccg cctgtgcccg 3 SO 

agtggcctct gcagctctta tgtccgcctc -aacgggatg tgcgccctga ctgcctcgtt 42 Q 

cttaagggca tagtgggtcg gctaagacc- gatcgecagg actgegttet gggcaggege 
tggcaaggce caaccaagcc gtccgtgccg ccagggagcc gagactgecg gaaagaagag 
eggcaggaga gggcgtggtc gtgcagcccc acccccggga ggggccccag geacegggaa 
g<raaagtcct gttggaggaa etcggtggct gtccacaccc gcctcgctgt cccggagtc- 
tgatctgtcg teggegggtc gccggcacag gactaaacat ggctgaggct gggccccagc ,20 
cagatccgcc tggggacagc accgcgtggg cccaggatcc acccagagca ggcaggcett 730 
gctgcrggccc cagtcaagtc cactsccagt gaggagagee ageegggagg tcagtgccag 
agctctgg-g gagcccagac ca^gca^tca ctgagggccg cccctgtcgc cgccctgggg 
tccctgtcct cctatcctga ctcc~gcccc agggccacca cccc;gaact g-geccuggg 
gcccccaccc cccacccggc cgactccacc cctgggcctg ccag-ccacc tggg-cccct 
ctggcccctg atgccnggac cctccg-gcc aagcaccacc aagcaaaggg gacgaccttg 
gccaccccca aagtgctgag actacagcca gtgagcccct gccgggggcc aaagecaegg 
agggtgcctg gaccctccca acctggaagg aggaggggag agagcaggca gcagggcagg 
gggaagagga ggagtgcccg atetgeacag agccctacgg geccagagag cgccgcctgg 
ccc^gctgaa ccgtagccac ggcctgcgcg tgggc-gect gcacaggctg ctgggctcgg 
cctccagtgc ccacccgggc egggtgeget gcccgctgcg ccgcagaaga cgcccg-gct 
ggagtgggag atccgccggc tacaggagga getgetacag gccgacgggc cctcacgcca 
gccccgccga gaggcccctg ca-cctacca ccgcaaccct gggccctggg gctccccgga 
gcaccgccac cagc-gcgcc ccctggcagg gcccg-gggc ggccggggcc gcctgccc:: 
cctgccctgt ccaccctgcc tggg-gcccg gctc-ggacc ctgegggage ggggaccccg 
tgcccgccgc ccggccctcrc tgagcctgcc ggeccttgag cccctggggc tgetge-gge 
cttcacgccg ctcctgctgc tgggaccget ccccgtgccc ctggaccgct ccggocgctg -743 
agcagagccc aggacagccc cgccgcaaca ggccaggggg cccagactgg cccacgcccc 1300 
catgcctggg tgctgtgagg cctgatgacc aggctggaaa aacccaaggt egggtccagg 1360 
gcagtggcct tcaatcaaga cctcccatcg ctgaacccac aaccagggct acccagaggc 
ctgaccctgc agagtccacg gctgcactgc cgcccagaca ctagctgaac ccaaggacac 
' cagcgcccaa ggacagcccc tggaggaggc cagcccagca ggaaactctg cgagcaggac 
cccattcacc ctgeggcaga cgggcaccgn actggccacg ggctgacgcc ggccacactc 
cccctccgag ggccagccga gcacagcagg cacgaaagca aacagagata cagcagegag 
tcagtccc-- ggagagggca gggactccgc ccaccctgtg tccagataag ggccagtgtg 
tgtccctgaa ggtcaggeca geegggggag gggtccacgc cgcgaaaacc cagcctgcaa ^ ^ 

aggctcctct ccccacttga ccaggcccag accaggtggg ggttggcgcc ggcctgtgtc 2->40 
gcaggggaca agggcccacc caggccccgg aacacaagct ctgcccctgc acaccctca- 
gtcaccacac ctgggatgga gacaccaggt ggcccagcga gagatggagg actgatcctg 
gaacgtgaag cagctttcaa taaaccagc: cctggggaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 



60 
120 
130 
240 



430 
540 
SOQ 

ssa 



900 
960 
1020 
1030 
1140 

12 00 
1260 

13 20 
13 30 
1440 
1500 
IS SO 
1520 
1S30 



1530 
2040 
2100 
21S0 
2220 
2230 



2460 
2520 
2530 
2SG3 



<210> 17 

<211> 1291 

<212> DNA 

<213> Heme sapiens 

<220> 

<221> SITE 
<222> (1279) 



9 



<223> a equals a,t,g, or c 



<22Q> 

<221> SITE 

<222> (1236) 

<223> n equals a,t,g, 



or c 



<220> 

<221> SITE 
<222> (129Q> 

<223> n equals a,t,g, or c 



<4QQ> 17 

aaacctctcc 

gcccacgcgc 

catccaagcc 

gcagcnccac 

c~caggg~gc 

g-caaccggg 

actccaaaga 

t-gg~"-gg 

cct:cc:ac-gc 

gctctctccc 

gaacgaacac 

tgtgcatcaa 

cagccacgc" 

tcgccaacca 

aaagcaaccc 

ctaacgcccc 

aaaactzcctt 

gtctgagatc 

tcctacacca 

tctgcaaccc 

gccattagaa 

aggaCccaag 



Cacaggtaaa 

ccggaaagag 

aacaagcca" 

ggcccctgga 

tcagcgcgcc 

Ccaaggaaac 

agaagcaaag 

aattg-c-c-g 

tgti&ac-ggc 

tgtgccagca 

cgkcagTicct 

tggggtagcc 

gatgacc-cc 

agcaaacacc 

tgtgacacca 

taaaagaaaa 

aagaagacg^ 

cgcicecagg 

ccaccactac 

tct^aaagct 

acaaaaaaaa 

cccacgcang 



gctggwacgc 

gaaacacaga 

gctgaagnaa 

tcscaacagc 

cagccccacg' 

atacaaacga 

gcaccagggg 

tgtc-aatac 

ggaeacccat 

tccaaggagc 

atcccggccc 

ggccaagacc 

tcccucctgg 

acaaccaata 

gcgtcctctt 

aggggtacca 

cccttaCcgc 

c-ggtcgcca 

atgccggcaa 

agaaacgctt 

aaaaaaaaaa 

cgcgcnigcn 



ctigcaggcac 
ggtgccaaag 
acgaaaccac 
ccctgggccc 
gca-cacacc 
taaacccaag 
tgatccagat: 
ccttctc'CC 
Cc^gggg-gg 
tttcccgticg 
ccaCiggagc 
actgggccgc 
agttcC-cgc 
Cgtccg-ccc 
cagctccccc 
gtczaacccc 
ccacaacgac 
a-ga£ggc~g 
aggtgaagga 
ccgttcatat 
aaaaaaaaaa 
a 



cggcccggaa 
gaacaaagac 
acccaaccc: 
aaccaactta 
cccgggaacc 
cgtgggaaca 
cacggitgga 
cagagaagca 
cccccctittic 
cctggcgaaa 
gacuctgcng 
gctttctgga 
agcccgcgcc 
ggttaCUcca 
cagacgcaac 
a-ggagaaaa 
ttctagtccc 
tacccccccc 
tcagaggact 
tact;t"ccc 
aaaaagggcg 



t~cccggg"Cc 
acaacgatgr 
Cacccaccaa 
gaaaaccaag 
— tgc^agca 
gcagcaacga 
tcga^gcaca 
tcaggv.uccg 
actacctctg 
ggcagcc*:gg 
ccggtggaca 
aaaggcaciz 
acagcccacc 
aatacgtatg 
aactacccag 
accacc;gca 
caaaaaccgs: 
cactg^c-cc 
gaaaaacgac 
t"aacaaaa~ 
gccgccccag 



60 
12 0 
130 
240 
300 
3 SO 
42 0 
430 
= 40 
S0 0 
650 
720 
780 
840 
900 
9SQ 
1020 
1030 
1140 
1200 
12S0 
1291 



<210> 13 
<211> 3129 
<212> DNA 

<213> Homo sapiens 



<4Q0> 13 

cggcacgagg 

cgttcctctc 

tgcagcgcca 

Cdggagccac 

acagggctct 

iaagctccac 

cctccccccc 

caacicccca 

agaicticggc 

tticcctcna 

ttnaccccaa 

tgattctcza 

ttcccnccca 

gagga-aaag 

tccaccatcc 

ccctccaaag 



ggagaccgaa 

agggccgccg 

gt-cctcggc 

caagagtcgc 

g-gccagact 

agg~atccca 

cccccctccc 

cg~ caaattc 

tgatgcaaac 

gccaccaaag 

gccattgaaa- 

aaccaggacc 

cccncgttag 

taaatagtaa 

t cg'cgcaag'C 

taaaccagaa 



cggcggggcg 
cagcaa:ggc 
tcccctcctg 
cagticagtcc 
ttcagtcccc 
gctti-cccg 
totcacttic" 
tctccg'ccta 
gccgcagaca 
ggacctaaca 
acatttutcg 
ctcaccagac 
cceccg-ang 
atggtiaCgac 
cggcagggag 
gc^ucagg-ca 



aggggagcta 
actccacccc 
cactcccgga 
cccctccg^g 
gacaactccg 
caaccitcca 
tacu^gtacc 
aacaactaac 
accaaccgat 
taattgaacg 
tgtgat~ata 
tatttcacag 
cctg^cctgc 
agtggccaaa 
gcccactaat 
aaaagcagag 



ctigcgtccgg 
ggcagcagcc 
aCcag- tzca 
gaaggccgag 
aaacccttcc 
tcaccaggcc 
cccagtGG — c 
gggccctccg 
caatcaa-ca 
accacaaaat 
agatacgtaa 
cttactcact 
aicctaaacc 
accctgaacc 
tcagcctccc 
aagccaccca 



cttcgcaggc 
acttgccggt: 
ccactcictc 
tcccaacccc 
ttccgtttcc 
acaccagcgt 
caacaactaa 
:attccaact 
gcaaacccaa 
cctgatcsca 
tcciaaaaaa 
cactgaccgc 
aaggcatgag 
tctntcccta 
cattccccgc 
agttctitcc 



SO 
120 
L30 
240 
300 
360 
420 
430 
540 
6QQ 
650 
720 
730 
340 
9Q0 
S60 



10 



atttatu-ta icaccatact; aacaatttcg ctt-cngaaa tgattacccc tacaagcagc 1020 

aaatacazct cctaggaagt: tagaattac" gatgccacg- e—cacatcc agnatcccaa 1QSQ 

gtaaacacca ggcaaaccca ggaaaacaaa tacaaccaiiu ccctcggtta accgucct-a 1140 

aatttaaCat acgtccacct acnccccraa acatacatct: cataaacggc gactacgtac 12Q0 

tccacgacat aaaaactaag tacttttagc gggaaatcct: tcg-ggaacc accgaaatga 12 SO 

taactagcgn tagcaacact cccgtttcat: gagacgaguc acttcacgaa gaccgcaaac 13 20 

tateetecgt cccctccaga guccaccuca tgggaaactc aaaccccact gTzcccaacut: 13 30 

taacgtcaaa cattccc-cc acat:acgg~a tzggccagaca tgctuccctc tctgcngcua 1440 

tttcacaect tagacgcag- ccac-cagga fccti-cce-ct tCuCC-ttag gagticagagc 1500 

tcgaag-gat gcaacactct; caccctttac ctctgaccca cacgctaaca aagacaactg 1550 

atagacaata cgacacaccc tiggccctaga agazccttiaa tgaggaaa" ctgtggcctg 1520 

tettcccttc tgtgctaect -tugccuccn aaggcaggca tc tttatcag ttcagtrtgt 1530 

tgcugngact agctcaaact tgzccagggc tccatgcc-g aaataagtaa tcaattagcc 1740 

taCatataca aatatataac caaccaccac aaccacctaa gaacgccagg gagtcgngaa 1300 

gt:ggaaacaa attaaagagu cgcaaaatCa aaagagctcg ccccccaaca gaagcgtigct: 13 50 

gacatacatt aaagaataaa tgaatcaaag tcaccaaaga ggaacaggag ctaaaaagaa 1920 

gaccaccg^c tagacggagc caggccgg-g cactgccaag gaaagcaaac agga'ccccg 1S30 

aaggcctggt: a-gaacitca gcctgcatca -cccgccgga gggggtgacc cccccaatac 2040 

cacgteagac ccacagc-cg tcct-gacag agcggcagna cctticccccc accgcactcc 2100 

caccaccagc aeagang-aa aacacagcaa gtacucagaa accacccgaa gagrgcagcc 2150 

accagtagag acgaccgaaa cacctgtctt tccagggaat acctctgcct tccccctncc 2220 

agaacccccc ggttacaatg tgcccaccgc caggtcacca gccacaaaac accacg-aga 2230 

ggttaccggt: catcatccca acaciacccaE caaaaaatct ggagtacatg aaac-gccct 23 40 

ccaccgtaac atcagacaaa aacacctacc ctatcaaata cagacccaaa acngccacca 2400 

aatcggaaga atadgacctt aaacccaaaa aaaccccaca cactnaaaca caacaa^cca 24S0 

tctttaccta cct-cctaaa caccaacgca tgagactcct cc-aaCacca tgaatacgca 2320 

ctacgactt: tacgctacac ctcttuctcg ttitccaccca tzgctagtgaa acttactagt 2530 

acccttcacc gaaacaccac cacttcccac: ggaggaaaat atcttatccc ccccacgaaa 2540 

agggtgctcc acccccgaaa cacagaattc actctgcccc ggactctgte cgtttgtc^z 2700 

cacatcaacc ccaetccaaa cgc;cac-ca aaa"actc- tcagac-Tiga guccaggtga 27SQ 

aaa£g-"aac ggaatcaaaa agnagtcagt: caaacgaaat: tccaacatat aagtcaaatc 2320 

tgaagaaaac cgaatcaaca aaggattcta agcctatraaa gaaaecaaaa tcacgctccc 2330 

aaaaaiatca Cgtaactgga gaaagcaatt ttacttcgag auct.cc.cacg aticcttaat 2940 

alacacttt; ectcatuagt: acgcaaccti tcggcaacat acacaggaga cagccaattt 3000 

Lggcaagac- aca:aa::ca tacatccctc aCcttgacat acatgatcaa acacctcata 3Q50 

acgcccacca qcctzqczcz tgtccactgc aacggaaaaa aaaaaaaaaa aactcgaggg 3120 
ggggcccgg 



<21Q> 19 

<2il> 3529 

<212> DMA 

<213> Hcma sapiens 



<4Q0> 19 

ccacgcg-cc gccagaggcc gggaaggaaa aacfcactccc accttaaaat accttaaaaa SO 

tacgtacaga cctiactgcaa a-caaaacce cccccacaac cacagcccct ccccacacat 120 

aaaattcccc ctaaaccagg aacataacg-c acagcctgtc cttaactigt accC-cccac 130 

tcacaccatg tcacaaaaaa atccccatgg cgacccacga tcccacaggt: cctgnccacg 240 

tgtcccccga gaagacagtc ccggaacgga accaccaas; caaaaggcat -izctaaggcc 3Q0 

gttacacata ccgccaaatc ggcctt:cagg aggatgacac aaatcccacc caCgticaccc 3SQ 

gagcgtgcca c-acccgccg ctaCtgcccc acgtccgagg tcccgtccct cctcctcar. 420 

ctcacgggca actgtcaaca tcaccaccac Cgtgatcctg aaaagaaagg cccagaagac 43 Q 

acagacag-c tgggaccnct: ctttczccct accccccctt gcactg~agg gaaggagtga 54Q 

gatgtggctg caggacCggg gaacagccaa gccagaaacg agcagcagtg caaccgtggc 600 

aaccccggag gggggcccca gccacgaaag aaaacagggg gtgccccgcc agctggaggc 650 

tgcagaacct -agcctgggc gtiiiCugcccc ctccgtggaa ccgacagatg acaggaatga 720 

cgtggag-cat: tctcatcaag cgcctcagca cag-ccagtct ccgagatggc aaac~tcagg 730 

caaagacacc tgggctgaaa ggccccacac --aaaacttg tcagTicagat aactgcccac 340 



II 



caccaccccc Cgcaagacac ctagatatgg gaaccccaca ctcttcccca ataagactc- 900 

tgtacgactg aaagac~ctg tggccaaaaa canaccttaa tccaacccgt ac--cccctg 96Q 

c£gcaac-tc agccttCtcc eatc"£cca cgg-gaatga acaccacaca ggcndccaga 1020 

tcagacaact Cctggctccg Ccac"uacta gc-gcaccag tnaacaccag ctgctgaczz 1030 

cacttocgca tatgcacccc aaaaccagcg ccccaaaaca acaac-actt actactcccc 1140 

acaagcc-aa aagncggctg aatgg~cctg ctgatctgac ttgg-ccaacc ctggccggac 1200 

actgcc-gat tcagcacagg CCgaccgtcc "ccgccccat gcctcccacc acccacaaa: 12SQ 

tgcccaggcc tgct-gcctg aaggcagaag ag-zcccac- agcaggagag ggcaagcacc 13 20 

tetcaatgca caagcacctc tcaagcctct atc-gcatca agcttgctaa ng-cccgczg 1380 

accaaagcaa atcazgcggc ctacntcgga g~cag-gngg gcaggcacca cccaaagaaa 1440 

gtggc-acag ggaagcaega agacctagag ccacnagcac aaccaatcna ccacacnaac 1500 

tttgtgacct tgggcaaCcc cctiagcctc cctgagctcz aactccacaa tctacaaaag 1550 

igcagaactu ttcngggact. ctctgctaca acgta^gtaa aatgtc-gct acacaataaa 1520 

cacgccacat tCatcaaCtc tctc;"ctcc cccnc-Ccati catttgaa" CCC-cctaaa 1S3Q 

cgcatacaca tatgngcata cacacacgt:c caccrcgaca Caeatacatg cacaCaCaLa 1740 

tatatacacg acttaccccg tcacccaggc gagagzgcag tggcacgatc acagcccacc 13QQ 

gcggcctcga ccccccaggc ccgagngacc g-cccacctc agcctcccag gtagccggaa 13 SO 

ccacagacgt gcgccaccac acccagctaa a£";:cccgt: a£accii"aca gagaagggcc 1320 

CcactaCgtt gctgaggccg gtcctcaact cccgggttca agcaatccac ccaccccgac 1930 

cCcgcaaagc actgggacta caggcacgag ccaccatgcc tggccc-caa acgcaccati: 2040 

cttatacatt cctciacaca cacaaagctc gaagazaaca acaacaaaca tcacucccac 2100 

ggcteccgtc gaatccccaa cccctagcct girgcc-aacc aagtgcgccc Ccaactcacc 2160 

ccaatggnce ggaacccgaa gactaaagga accaaagggt acctctcccc tccg-tcccg 2220 

ggaccatcaa g-cg-gaaac gacctcctgg ccucccgacc Cgggcccact: accacaaatg 2230 

aaacagaaag gag~caacac atctgcatca actgaigacc c'caacagag gcaacaaaat 2340 

gagaaacccc cas-ccccaa cttaaatgcg gc~"gggga taggataagt a;cc-ggccg 2400 

ctcct-agcc aaccaggccc cagaagggac accaactgca gccagcgcac cctgctgggg 2460 

agacccaagg c-gccagccc cctcgcCgag aatggaacag aa:ti:'aatt ggccgtgaaa 2520 

gcaecatggc aggaggcaca gggaagagtt ggtocgcccc cccaaagcag g-tagggaag 2580 

ctccaaggga aaggacEcaa acaggcgcac ggcagcaaga acctcccacg cticaggggcc 2640 

ttcccagctg caacaccaca gtcccaattc cagggctgga ccccagccca ggtgaaggac 27QQ 

cccctcacac ctgggcgacc agnacCggag cacggagcct gcgaatcucc tcatctccgn 27SQ 

gcccccccca acicccacga gagaagcacc acacagttac tg-cactcgt a~tg~caatc 2320 

tEcgacccca gagaggtccc caaacataag cccaactrccc cagcgtactg Cgngrgtgcc 233Q 

aticaguccat: gngtatggnc tgaagcaacg gcacagcacc cccgcaccg-c tgcticagag 2940 

caaCgtgaac C CatCCcacc cacaaeaaac taacccaacc acgcctaggg caaag-Cgtc 3QQQ 

ctcatggtaa cgag^a"g~c c:c;ctgcca Sggcaacttc Cggaacacac acacagacaa 3060 

gaatgacatg tgaagg~caa tagatgagac Cacgaaaaac aagacaacat agctccccag 3120 

cataaacgta caatgacgca cgzggcc-Cu ggagacizgtc ggacaaatca cctaaccuc" 3180 

cggcgcctca gccctciaac eatcaaaa-g agggcaacag taccgc-gat . ccaa-aaggc 3240 

tttcggaaag aataatgaga cgcaaagagc ctagaatagt gcagagcctc aacaaacacg 33Q0 

Ccttcgtgac Cccttczagt: gaagegacat: agagaagtgg gaccagccac tgacccaccc 3360 

aagctctgtc aacctagcgg taCagtaaa- ccgctggcaa aaagcaaagc agaacaacaa 3 420 

aaaaacgcaa gcaaaaccac tgctttcagg aaactatti: gtgagacaga agctzgacaa 3480 

aaaczcaaaa tccgcagcaa actccacutc a:taatact; taaaacccao agcaacaaac 3540 

cattacttag aacaagtaag cagCCtCgct tccactgtgc actccgatac t:ac::i:caa 36Q0 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 3 623 



<210> 20 

<211> 1144 

<212> DNA 

<213> Homo sapiar.s 

<40Q> 2Q 

tcgacccacg cgtccgcgcc tgacctgctt 
caaaccgagc tctggagaac gccatcagcc 
catgggncga ctcgacggga aagtcaccac 
agcagccgcc etagc-ettg caagagaagg 



ccagacaaag gncgnc-caa g~ncg~cgc~ SO 

cgccgcctaa aaccaaacca caggtcccac 12Q 

cc-gacggcc gctgctcagg ggacciggcca 13 0 

tgccaaagtc atagccacag acaccaacga 24Q 
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gtccaaaciit caggaacngg aaaagtaccc ggg-attcaa actcgcgxcc tcga-gtcac 3QQ 

aaagaagaaa caaactgacc agtccgccaa Cgaagcigag agactcgacg ctctctc^aa 3 5Q 

CgtCgctggc tccgcccacc acggaact:gc cccggatCgn gaggagaaag accgggaccc 420 

ctcgacgaat cccaacgrgc gcaatgtacc tgacgatcaa ggcatCcct" cccaaaacgc 430 

tcgcccagaa aticcggcaat aecaccaaca cg-cctc-g- ggcttccagc gccaaaggag 540 

tigtigaacag acgtg-gcac agcacaacca aggcagccgc gatcggcc-c acaaaatcrg 60Q 

tggctgcaga tcccatccag cagggcatca ggtgcaactg cgtgtigccca ggaacagtcg 660 

atiacgccatc tctacaagaa agaatacaag ccagaggaaa tccrgaagag gcacggaacg 720 

atttcctgaa gagacaaaag acgggaagat tcgcaaccgc agaagaaata gccacgctct 73Q 

gcgtgcattc ggcttccgac gaacccgctc a"gt:aac~gg caaccctgtc accatngacg 340 

gaggccggag cttgtgatt" taggatcccc acgg^gggaa ggaaggcagg cccttcctac 9Q0 

ccacagtgaa cccggctacg aagaaaactc accaaccacc tccccccrgc caaccacatg 9 60 

ttaacgaaaa taagctcccc ccaatgaCgr cacngccugc aagagtcuga tccctzcaag- 1020 

atattaatc- ccccgcaatc tcctccgaaa ccatcgraaa gaaaCaaaaa tatcgaactc 1030 

acagcaggaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaact: 1140 



<210> 21 

<211> 1443 

<212> DNA 

<213> Komc sapiens 



<4Q0> 21 

ggaaccaccg gcciatatcg ggttggatct atcaetacga g"gccg~cg- ccttgcgcca SO 

ggaaacactzg tagggaag^a tggaacacga actCgccccg cctttccctc aagcatacca 120 

tatactcgtc ttcctg~c-g ggc-ggctcc agaatccaU accagccacc agaaaa::a: 13Q 

aatcaccccc caaaggccac tcaagaagcc caagcgaaag acccgctgag aagaccacct 240 

gatctaatgc Cgg-tgugtig tcccctcccg gcaacsggac 'tetgcccg-c cagagg—tg 3 00 

atcgctccgg aicgcccatc cgagctccgc cgactaiaca cgcaattcca agagcccta; 3 60 

ctaaaggacc c-gctgccea tcctaaaatc cagacgccgg cataracg~c ctiaccccgtt 420 

ccttacctcg igactgcact gcacggc-ca gcggcccccg gacgccccng gatgcccgac 430 

aicacattga cacatgccgg agg-ctggcc caggcccagt ccccccacac tggtgcatc- 540 

cttcatgcta gaaccgctta tgtctacaga gcccctgaag aagcaaaaac cctccctcca 600 

gcaccaaaca tagcacacgg agcccccccc cagczcctgg ccnaccgecg tatctacaaa 650 

ccagagttct tcacaaaaac aaaggcagaa gaaaaagcgg aacaaaaaca tcacttcacg 720 

ttcc^cctcc ccaaatcacc aact-ccgct atactggtac tgataccctg tcccatttca 730 

ctc-cttccc a&acg-gag- actcaagaac atgcacaccc :cgccc-gca ccgca-gtigT: 840 

gagctatacg gna£tg-g~a aattccttct gaaggaaaac ggaaazccct gagaaacagc 900 

ttgtttaaag aaaia:accc aaaacoattt gtgaataaac tcgatcaccc atcccaacat 9SQ 

cgtccgacat acaaaacaaC cacaagtgta aaaaattaca acccagngcc aacag-agrg 1020 

agcatgaaac gaaactatcc aaaagagaa- atggcctgcg cacattaaaa aatccaaaac 1030 

agtgaacgca gacnggagga gtaacctctg caaataagac gaacacgc-C catcactaaa 1140 

ctcaatacaa aaggcaaacc accagaacac "CCaaacgt tgtt-gaaaa atg~cccccc 120Q 

aaggaaagcc catcatncgc cgctgtc:ca agaaaa;tac ttccactaaa cctctttgcg 1260 

tgaatutaaa cagctaaata gggaccag-a actCca'ctc tacccttaac gaacatccgc 1320 

tctaccggcg gctggaaaca cttccaccgc acctctgcgt aizaccSucaa taaaatcat- 1330 

ttcggccccc taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaccc 1440 



<21Q> 22 

<2il> 1053 

<212> DNA 

<213> Homo sapier.3 

<40Q> 22 

cttcgctggc gggaagaacc cgagacggcg gcagccagcg ctggggcaac ccggc-gccc 60 
ctgctcctgc tgacgccggt: agcagcgccc agtcgagccc ggggcagcgg c-gccgggcc 12 0 
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gggaccgg-g 
gccaccccaa 
ccagcccc'C 
cagcagccgg 
ccccccgccc 
ccccc^accc 
tutgCGccag 
cctgcagcgc 
tcgagggtac 
gccaggggcc 
ctgcctggtg 
gactcgaagg 
ccaggctgg" 
tgtgncctcc 
cigcgggcct 
atcgaaaaaa 



cgcgaggggc 
gaagggcceg 
gtzcc-ctgcc 
cgtctcctgc 
anggagraga 
cgagcccacg 
gugg-cucac 
cccccccact 
cctgggtcec 
tctgggcccg 
caaacaagga 
aaaaggtcca 
cagcgucccg 
accacccccc 
tcgctgtgug 
aaaaaaaaaa 



Cggggcggaa 
ccggcccctc 
tcctgctggc 
gcccgcctcc 
gcccgagatc 
cagnccggga 
ccTZCCZ.q~ac 
ttgacactgg 
cccatcaggg 
acgcccagtg 
acccccgcct: 
agaaagaaaa 
tcttgcacac 
gccgcccagc 
ccaccctccc 
aaagggcggc 



ggtcgagagg 
ccctcccccc 
agaggagcag 
cccatggccc 
ccggccacca 
acacgccgcc 
ctggccgggc 
ac~aggacgc 
gcagaggcac 
cagcccctgg 
ttaaggtgac 
caaaaccaac 
ccaggggccc 
atcctacccg 
tgtaagccta 
cac 



gcgagxcccc 
gticccacgag 
czggactggg 
cacgcagccc 
tgccagtccc 
tccccticcag 
Caggtggncc 
agcctccctt 
gaaagagt.cg 
gg-ccgtigg-t 
aggcccccca 
agtcag~ggc 
ccccccccgc 
gactgcggcg 
tccaaaaaca 



tgtgccctca 
gccacccggg 
gccc~~ggca 
caggggccnc 
gaccccgcac 
cctctgcgcc 
tgtccaggct 
ctgtg-cccc 
gggctggacg 
cgacatttgc 
caggcticcca 
cggtgagggc 
cggagtcccc 
ctacgagggc 
tcgacgatac 



iao 

240 
300 
3 SO 
420 

4ao 

540 

soa 

65Q 
720 
730 
340 
900 
9S0 
1020 
1053 



<210> 23 

<211> 741 

<212> DMA 

<213> Homo sapiens 



<40Q> 23 

gtagacGGcc 

tctagcagcc 

catccrggac 

agtcaaccc: 

Ccagtaagcc 

gacaczccca 

atccc-agac 

gaacttcacg 

catttccgaa 

aataacgctc 

ttttgcattc 

aaaaaaaaaa 

tctccccccc 



ggacct: cgra 
gagcaaggct 
gttaaaaccc 
gctgagacct 
ttgaaaaacg 
ccacctcaaa 
aacagccgad 
ccaacgggaa 
a" eatccac 
CgctaCatga 
gtcccagtt" 
aaaatgaccc 
cccgcccccc 



tgtcact:gag 
caggccctag 
agccgtcccc 
aac" cacata 
cctauacaga 
agactccgcg 
g-gctgccac 
gcaccc-gca 
ac-gtcgtgn 
acatactaca 
ggggctacca 
tcgagggggg 

g 



tctaagagaa 
aatggctcag 
c-gaaamcta 
taa:aaaatg 
tccacgtaac 
ugctccctcg 
tgcacacaca 
tgtactgctc 
g~a~ccaaag 
aaatatccgt 
tgaacaaagc 
gcccggwacc 



aggtagtccc 
cccgctccat 
aacacgaaag 
aatca-ttca 
tgccaccaaa 
tagcaagtct 
caccagcttc 
tgtgtctggc 
tgtgt :cccc 
ccgcCGaccc 
tcccg-gaaa 
caaaacggag 



aggaccccgg 
cgcaatgcac 
accgagacCt 
acGgtgcagc 
accgagaaag 
cccccacccc 
gcccgtccca 
ttcctcagtg 
tccatcgcca 
accggnggac 
ai:caaaaaa 
ta-ctccctc 



SO 
120 
130 
240 
30Q 
3 SO 
.420 
43Q 
540 
60 0 

sso 

720 
741 



<210> 24 

<21I> 946 

<212> DMA 

<213> Homo sapiens 



<40Q> 24 

ggcacgagcg 

ctctactgga 

tcatccccaa 

gcccccccac 

ggtgagcgga 

ctctcgccaa 

ggagticcctg 

cggggagagg 

Gccccccacg 

Cgccc^ggra 

acgccaggca 

agcagcgagg 

tctgtagaac 

gccctggagc 



aaagcctctc 
ccggaacaca 
ggccgagaCg 
ccccizgcGgg 
ggcaggggtg 
cagccggggg 
ggggtggccg 
cctggggcct 
tctcagccac 
a tgagcagaa 
aaagctcccg 
gcaaagacgg 
ggggacgatc 

CCCttGCCCC 



tc"aacaac 
ctcatg-gcc 
c-gccggcgg 
acggacaccG 
gacggaggcc 
cgggagcctzg 
ccaaaggtg- 
tggccacatc 
cacagggtcc 
ggaaagcc" z 
ggaaaagccg 
tgaacacagc 
acacccgcct 
cgagaccggc 



gg-gccgcac 
ccgccccctg 
:cccacgaga 
ggccgt-gagc 
"cgagggccc 
ggtctcgctc 
gta-gacaag 
c^ccaagggc 
ccc^cccticc 
ggggcgngcC 
gaggagsctg 
cctccagctg 
cacaagaatg 
acccaigagt 



agcttcgccc 
acccagccaa 
gcctgcccac 
caggcGgggg 
atcactagca 
aggccagagc 
cacgtatccc 
tgctgggctit 
ttgcacccca 
caaagtcagg 
gggcggccac 
cctgagcctc 
tcgagacaat 
ctgccgggac 



tirgaaagcac 
ggctgccccc 
gggcncaggc 
tcacggccgg 
gggtcaccac 
Ctctcaaccc 
Cggacacctc 
cggagcag-c 
gaccctccgc 
agagcaaaa; 
cgggacgugg 
agtt:tccaa 
ccacagagac 
cagaaaac:; 



SO 
120 
ISO 
240 
300 
360 
420 
430 
540 
600 
660 
720 
730 
840 
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atct:cagggg cccagcgggg cacccaggag agnccggcgg cgcaagcgcu gcacaaacca 9QQ 
caagcgt^cc ccaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 945 



<21Q> 25 

<211> 831 

<212> DHA 

<213> Hamo sapiens 

<22Q> 

<221> SITE 

<222> (5) 

<223> n. equals a, eg, or c 
<220> 

<221> SITS 
<222> (10) 

<223> a equals a,C,g, or c 



<220> 

<221> SITE 
<222> (11) 

<223> a equals a,C,g, 



or c 



<220> 

-<221> SITS 

<222> (15) 

<223> n equals a,c,g, or c 
<22Q> 

<221> SITE 

<222> (27) 

<223> n equals a,t,g, or c 



<4QQ> 25 

catenae ggn 

tcccgggccg 

cCCcctggac 

ctccagaag- 

aagttccagc 

gtggtaccc- 

acacttccgc 

caactgtgga 

gcacggtgg- 

ataacaaege 

cttgggg" - 

ggcagtcca- 

tgsctaccta 

gccgctccag 



naccncCac- 
acccacgcg" 
gtgcccgccc 
tggcccagcc 
tggt:accaat 
tgtgggcatc 
cccgcagtgg 
ctcacacuta 
caacagccca 
agcgaacccg 
ctggcccagg 
gktaaaaicia 
gcagcaaaaa 
aggacccaag 



ataggunaag 
ccgggggaaw 
c-ctcggctg 
c~gagtaaaa 
ggaggageca 
gcatacttcc 
ccaactaccg 
ggggctacaa 
acaaacaakg 
cccccccctg 
gggtgaacc- 
ccstccacgg 
aaaaaaaaaa 
cccacgtacg 



ctggtacgcc 
ccccagtcga 
caccggcccg 
gttctccaag 
caguggtggg 
-cacacagci: 
cattcggtgg 
cgcagtiacac 
caaggcacat 
t-ctcazcgc 
cact tcac-t 
aaagagcccc 
aaaaaaaaaa 
cgcgcacgcg 



cgcaggtacc 
cttccccaag 
ccccgccgga 
accgacaaca 
ccc:gtctcc 
gate ~~ zgtcg 
tttagccgga 
Cgggncggai 
agcagggaaa 
cctCLtgccc 
coacaggtcg 
atgeaacage 
aaaaaaaaaa 
acacca cage 



ggcccggaat 
cagcactccg 
gaeacaeggg 
acctgggaga 
agtcuccccg 
aacgat~ag 
tcaccaggac 
gaaagcactg 
cccattc- eg 
ccaactgctg 
aaacngaacg 
acaacaaaac 
aaaaagggcg 
t 



SO 
120 

iao 

240 
300 
3S0 
420 

4ao 

540 
600 

sso 

720 
730 
831 



<21Q> 2 5 

<211> 1294 

<212> DNA 

<213> Homo saaiens 



<4Q0> 25 

ctgeagaact eggcacgagg ccaccccacc tctctcggcc ccagtt-c-c cgtgaaacca SO 
ggaatcaaca tggrcccuga gacccccttc egaeggegaa tgcgngguc- ggugattccg 120 
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tggccctgca ccaigaccnc atctiagctct: ctcc-aacag gggacg"tc actgccccgt: 130 

aaaacccccg tgggaccgcg Cgtcrccaca ggagccaggg "gcaaatgaa cagaatccag 240 

attggtccta acacactcta cccccaaaag aaagggcacg gggaggccat gacctcaaag 300 

caggutccct: ctgncgccug cgaagcccg- gacgatcgag aguggc^ggg actggcggga 3 60 

cgatgtctgg guggaagagg gaggccaccc cgatgcgccc cgtcccgggg aggcacccag 420 

cccgtaagga ggngacgtcc acctacaccg agcgcaagga ccc-gaaccg ggggaggctg 430 

aggcggggcc tcizcgaiitcc caccccgccc cccagtggct aggctiagtgt: ggcccgggaa 540 

atgacttcca tccctccctic caggcacacc taataagagg ccagcat"c caga~"C"gc 600 

cgcctccgga cgaacgcccc agagagccgg gaggcgcccc ggaggaegga accc~zacz.z 65Q 

gagcgtzgcc gaggcg-gtc gggggtgccg tggcacaggc cccctcccct; ggggggcacc 72G 

actgtccccr cgccctgcac ccccgccgcc ccccctgccc ctgaacaggc g-cggagacgt 730 

gcacgggaca ctcggaggcc ggacgctcaa cagag-ggag cgccgcgacg gngcggccgc 840 

agcccggcrg tgcccccacg acgcagctgc aaccagagga gccgtgggac gccgccccac 900 

gtggacacag cccactcacc gggrgccgcc cccg"gctgg gcgctgcacc c^tactgtcg 9 SO 

ttaaaaacct a-antaagaC gcggccgggc acggrggccc a-gc-cegtaa tcccagcacc 1020 

ttgggaggcc gagacgggcg gaccacgagg tcaggagatc gagaccaccc tggcctacac 10 30 

ggtigaaaccc cg-ctctiact aaaaatacaa aaaaaccagc cggg-g-agn ggtgag^gcc 1140 

tgtagnccca gc^aczcggg aggccgaggc aggagaacgg cgcgaacccg ggaggcggag 1200 

cttgcagcga gccgagaccg cgccac-gcc ccccagcctg ggcgac-gag cgaaactccg 1260 

tcccaaaaaa aaaaaaaaaa aaaaaaaacc cgag 1294 



<210> 27 

<211> 1556 

<212> DNA 

<213> Homo sapiens 



<400> 27 

ggcacgaggt ccacagcacc tgacctgcaa ggcagcta^a caag-tcctg gacic"gta 6 0 

gctccggagr: gtttcacccg accttaagcc cacccca:cc acccitaacc aagaaaccat 12 0 

gtgctccccc gcacgcctgt g-ccccccccC cacgtgtc-g ccgtctgcg- ggaagcccgg 130 

cctggcgcac gccgtggcgc accgcacgct ggaacccgng gagt:ctgcac gcgcggtaca 240 

gtacgaggcg ggccacgutc cgtagtgcgc gccgegggct cccgagaaac aagceaaagc 300 

gtgtgctgaa acagatccta tcgacacaaa acaagcccta ttgcnaaact taagagaacg 260 

Cgitacaaac g--ctctgct aaacaccagt accgatcatc ctacatga-g tacctattga 420 

cataacaacc tgaaaccccu ga'tccagac aacc^ctcct caagncgacc cagc-gcatg 43 0 

actctcagaa atcag-ca:c tcccatcgta gaeegccggt ttuctccctc eagctcgcac 540 

cgngtatc;- cctcccgtgg agaaaacgtg gtcggcaaga aa:gccatat tccaaagctg SQQ 

tatcglggc- gctaacgcag aaaacaccag tgtac-gcag gccgnctggc agnggggc-g 560 

gggctgagng tcc-gccctc agtggcccgt gcccg-gctc ccgtccgccg aca^gcaga^ 720 

acaggggcag aec~gagggt tcgacggagt gcagaaggcc acacgcgcgg czzzczqzaa. 73 0 

aEgcagaaac a:ggaacccc tgagcagaca cttgcctccc ggagcacct- gcacgga;cc 34Q 

cgccccc-ga tgcttcattg ccg-Caaeag agtggtgg-g gtngtg- tat gagaaattct: 90Q 

gtctaacccg gctcctgaaa ttictcaaac taaacatcca Cgccgttccg cgcctcecte 960 

aatgactgag gccagrcgaca ccacccagaa aagtaacagc aacctggg-c ccctgagcgt 1020 

caggaCgccc accatc'aac c-gtccc-cg ccagngtcta gtacgncggc ccccggcagt 1080 

gcaggtgtigt: gtcc-gtgtc ctttcccg-g cgtcccigca ctagtggcag cctctgctcc 1140 

caggccag— tagagtagac tggctccggt accgccagca agtagrz-gct: gctacccag-c 1200 

gtagccatga agcccagctc cttggacc~c gacacacaig ctccaggcaa agtacgtaac 1260 

ccagacgctt ccaaccccc: ccagacgatc gcaaccg-tc ^ccatgccgt. ccg--aggcg 1320 

tcatgtcaat tcccgatcca acag-gngcc cgtaacata: acggtagcga agagacacca 1330 

cacgcagaga ccgttccccc C-caCcaacc acaggtccgc tatcgacgag agcacc-ccc 1440 



Cgctaggcag tcaccccact tcccgctg-C ggaggatccg aigagagacg gcagcgacgg 1500 

cgctgccctc ccagctg-ga t-cactatca tcgcccagag cagatgaaao tggacggcga 1560 

gtggagaacg ct;cccgaaa cagatccgaa aaggctcaaa ggaaaacca; agzg~aca!:c 1620 

gatccacaca cataccaaaa aaaaaaaaaa aaaaaa 18ca 



<210> 23 
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<211> 1350 

<2L2> DMA 

<213> Komo sapiens 



<4QQ> 23 

gtgactccca tactcaaeag acccccgtaa acgtCaaaac acctacaCzt aaatrgttaaa SO 

acactaaata tagagagggg ctctacccca atcacagagc aacaacaaaa ataacgctta 120 

tagc-aaact gcccgtccca gaaagcaccc gctctcccan gccacccc-a aaccctcttg 130 

tcatactt" gTicaCcgaac aatgccctcc ctctcgcctc cca.zccCCB.t ccagaacctc 240 

tagaagacca caatcgtigga gacacactac ccagtactgt: ccgacacacc ttcaccngac 300 

aaacactcag tgcaggaaac Cg-gatltgc tatacgctta cgcacacaat: ecractctgt 3 SO 

agtcaecaga atgntaacg" aaggtacatc tgatttttac CCCtcaca~g cg~ag~Cctc 420 

CCtctccaca gtcaaagcac tcaeac-aCc gggggtgggg gcagggaacc aagciiggcgg 430 

gctcgaaaat: ccatccatat gt:acc~gtct: acaaacgtcc ggggacaacc taaactcgaa 54Q 

acctaagcta cacacag-cc ggcaaigctc tcccccaaca cccacaataa taggacgacc SOQ 

tacaagaaaa caag~ccccc tticgcaaatC tctaccaeac Caaag~tgt:c ccctcaatct SSQ 

agcataccta aaacaggatt tagt"cagr:t tagctcacac aggtg-itgc tgacattcac 720 

tggcca"ca atacag^gtc gagtggctcc ccgcaaaagt aCaagcgcca acac-acgaa 730 

gaaaigcaca cgaccacccc cgcccacccc tacaacaaac ttiacacaaaa tggactcaaa 340 

aatiicctacc cacagcccaa aacucccgga gz.zz3.cta.cc cccctzccaa actcacag-a 900 

agaccacctg Cgtattttac acc:tag;aa agccaatcac gaagtacaag tatcatacac; 960 

g-actcccga gctactatta cntgaaaaaa acctgccaaa cagcacctitc agga-acacc 1020 

tacactttca ctoacccaaa aagcacacaa aaa^aaaaag -ggaaaaagg tatc~cctga 103Q 

a^gtccaaga gcatccnaca gtgccaaaca acaaagcacc a-"tz~zct. tcacaaccag 1140 

gatCaaaac; catataiiacc gcagggcaga cacacacacg atiagccngcg c-gactaacc 1200 

taacccca:; tgcaaacaga tgaaaactct actccctiiac c-cacctaca aga^ggctca 12SQ 

atgtattzggg aggcctctcc ttcactacag aaagtg^ara c~gg~acaca acaaatgaac 1320 

ttttcaaatg aaaaaaaaaa aaaaaaaaaa 13 50 



<21Q> 29 

<211> 17SS 

<212> DMA 

<213> Komo sapiens 

<220> 

<22i> SITS 
<222> C1743) 

<223> a equals a, eg, or c 
<22Q> 

<221> SITS 
<222> (1743) 

<223> n equals a,t,g, or c 
<220> 

<221> S~TZ 
<222> (1749) 

<223> a equals a, eg, or c 



<40G> 29 

gcicagcgcg c-gcccggct ggggacccgc gcacccgcag cgcccgctgc Ccggccctgc SO 

atcctgcccg ggcaccctgc gcccggccac gacggcgcac tcacccgccc tcccggccac 120 

C3.tcttC3.cc acg--ctggg gcctcgtcgg caccgcgggc cceggctcgc gecgaaggga 130 

cccaaccgcg gag~gacca: caccacgccg gucgccaccg ccgcccgcug tCaccccuCC 240 

tggctcatcg ccancctggc gcagccgaac cccctgtccg ggccccagcc gaagaacgag 300 

accatccgg- aeg-gegct.;: cccgtgggag tgacccgccg cccccqzccc aggcgcccag 3SQ 

ccctcggaac gaccg-ggcc ccaccgtccc tgacaacccc ttcgcccgga ccctccccca 420 

cacaactacg cctggtcacc agcccccccc cgc-ggcacc cagagacccg gacccgcagg 430 



17 



cctgcctggt ccctggaagt: czzccczqzc ttcccagcca gcccgggccc Cggggagccc 540 

tgggcacagc agcggccgag gggacgrcct gccccaacac ccgcactgc^ cxggagcctg 600 

ccctctttcc caaggagatg ccgccgggga gctgg-atgg g-ggggtccc cccctctaca 660 

gacgcggcag acgccaggac tcagcccacc cngaggagga cacg-gtcct: cacggagagg 720 

gtgctccggc ccaggcgggg gagncggtgc ccagncagca gctccgccac catcctgctg 730 

ggaaccgggg gggcccctat CgggctaCag gcaaggccct ccctctggca cggaaccgct S4Q 

aactntccga cacgtctaga cg-gaaatcc ccgaaaacgc cgaagcagag aaaca:i:cac 900 

acacaaaaag caacatagtc a_gtgggt:c:c agacggccic agtcctagasi gttggcaccc 9SQ 

CtCgccg^gTi ccccccagag caccctgrcc cgcctcocgc caccccgacc cccctcagng 10 20 

gacgtct^cc czcccccqzc ccca.qcct.qz ca.qzccqa.qc acagtgcagg tctggccc-g 1080 

acttgggccc ccggccgcag cgggggcgga tctcagagcc tctcacggca gcacc-aag- 1140 

gaccagagct: gggacgagag aggggaaggg gcaatgngag tggcgctiacg ggacgggcca 12Q0 

gccccgctcc cgagccagcc ccqccczczq cccccZqqcc ctgggccctg tgc-agggat 12 60 

ggtgaagaac gggggcg-gc carcctgcag gagcgggaag caacacgcag ggg-cccgga 13 20 

cccctcmagc cttgccctya cgctitiacccg agctcccag- gtgg-c-agca cagagcccac 13 80 

ccacctcgcc Cggcccccag ctggggcc-g cccccactgg cgccccaggg gaagaaacga 1440 

cagccccac- tctgcacgga cxgccgacgt ggcc-gccac cczqr.zca.qc gggcatcgcc 1500 

. ttzuggagcag caggagacxa ggacgcccct cactcacacg cca.qzzcczq gccggcoagc 1550 

tgctcagggc tcaggctggg gccccccacc gaca:cctcc ccctacacuc cctccccgag 1620 

cccccgtcgc ccctcccg-t gggt.aaggg~ gt:t:gagtgtg actmgcgccg aaaacccgg- 1630 

tcacacaaaa taaacaacgg tgatgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1740 

aanaaaarma aaaaaaaaaa aaaaaa 1766 



<210> 30 

<211> 2790 

<212> DNA 

<213> Homo sapiens 

<400> 3Q 



ccgtcttggg cnactggcag cZcczctccz ZqqqcZcczq gccgccaggc gttgg-gcca ■ 60 

cttcttaaag gcctggaacc agggaggaga ggaaacgcca ttgctgtggg cZZZczccqq 120 

gg-c"gcgcc g-gcccgc-a gagcaacccc cgtacccagc tcctt-tgtc cccagggccc 130 

ctccctczqc cccaagcagc cagccagiac- tgcctiaggcc aaatgcacaa gcccagaaca 2 40 

gatctgacgg tgagotggga agctgcaccc agagcagagc aaacgaggga gggggcgccc 300 

aggacccagg ccccccaCgg gctaacgtga gtggcagcca tgccccacgc cacacctcct 3 60 

tcgcaaaccg atggaccggg tgggcccggc ccgagctggg gccacaaa:c aaaccaaggg 420 

ctccagcacc cagcctgtgt gccczgtaac ggaaccgacc cccccccctg aaaacgaaag 430 

ggccccgggg ctggcaatca gggaaagc^c cacggtgcgc ggctgnggca caaacctctg 540 

gaaggciiggc tgaccggaavi gcagggaaaa cggcagtacc- tgggaaagga cccacccaic 6QQ 

tccccgctagc tgcaaccgcc gagccactcg cagccccagg accogccgcc accacgucng 660 

ccaggcccat ctcaggsgcc actccctgag cttcggggac agccggcaga gaaggcccct 720 

tgtgctcacg ctcccccgca g-ccccagcc czzczqcczz tctcccccga caccgctgca 730 

ccagagcgaa agggccaegg caagggggtg tcatccgagg agcatzaaga atgcagaccc 340 

ctgggcccgn cccccaaggc tccggagtca g-agg-coaa gggccatacc ttiagagaggg 300 

gticcgggtea agcacgaggt gaaacgggag acggzcag-g cggagagggg tqcz.ccca.cz 9 60 

caccagggic cgcaccagcc qcZczqcccc ccgggcaccc acccagcgcc gccaigccac 1020 

tgccaggcac ctggcc^gcc gggaaccccg cagcccgtiga agcagrgccc cgaggcaccg 1030 

gcgccgcagg eactcccccc tgacggccaa gagcaccg-g acccztcagg gccagaagga 1140 

gggcagagcc aigggcotigg gcctgc-tct ccaggaCcct gcaggaacga gcaccggcca 12Q0 

gagagggccc agccg-agcc acggctcagg caagcccccc agccctcgcc cccacccctc 12 60 

ggacccacca aaczgcacac acagctcctic tcaccg-agc czccqzzzs.z gggcctcgcx 13 20 

ctgggcctcg caggcnccgg gctcagggct: ggagngcgc- cztqqzcccz qqzcccZcqt 13 80 

ccacaggggc aggcccggga cccagccacc ctgtccaggc cactgtggcc agagccggaa 1440 

ggcagggcag a.ggga.aCg-- ccctgcaccc tggaaagggg agttgag-ca caagaggcca IS 0 0 

aggcgggccc aggaaggcag ctgcccttag tgcccgccca ggagttgag- acagtigagga 1550 

gggtggagga aggtgctgag cncagcctcg tgoccngcoc ccatctcccc aggccticcag 1620 

cctctcccgg ctgcctgcca cccaaagaga aaccacaggg gcggggcagg aatgcaaagt 1680 

qzzztczca.q aacagctgaa acactccgaa gagggaacgg acggggagaa cggccaacac 1740 
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acaiaagacc g-g-cccaag gagc-gacct ccaggcccc- gaggactgga gaccgc-cca 13 00 

cccctgcacc ccagacaccg cctgzccccc ggggcaagg- ccccccaccc tgagcccagg 1360 

ccgtccccct cggccccccc cgtccaccca ggcugcaccg cagcgacggc gcgggaggca 1920 

ccagctctgt ggcctgcgtc cagcagctgc gggtctgaag gaatagccag agaggagcac 19 30 

ctgaacccca tgggcctgga ciiccctgggg ccccgcrggg acttccccgc tgccccagcc 2040 

ggcaggacac atcccggcct cttcccccac ccattccccc acgcggccga agacatticca 2100 

acaa&ggggt gggcccacaa tagttagccc Ccag-cagcc cccggagcac agccctggga 2150 

gggggctact tctcncccca czgaaaacat tccaaagccg agccaccigt. ccgaggcccc 2220 

acccctcgga agccgtccga ctccagag-c Cgagcccccg gctagtaccc taiagagagg 2230 

ggqczczcca. aaggggccgc tggggcacgt grgcctg^gg cagaaaagag gagaccccgg 23 40 

aattcagcac cctgggcgcc aCccccagcg ttcagtutct: agaggcc~ca gc~tc"ccac 2400 

cagct-atgg gaccc-tgtc tttacngaca agaatggaac agaaacgtaa aagcac-ctg 2450 

aaaagcaatt gccc^gtaac tcatctiagaa agaaaagacc ccgagacccc agaacc-gct 2520 

gt-gccaCag ccccacacgt gtgaancctg caactagcca aggc"agctc c-ctcaattc 2530 

cacctaaaaa acaaaaacca gcaggtgtgg tggctcacgg cg-aatgggc ccgcccaacg 2S4Q 

ccttgggagg ccaaggcagg tagaccgctc gagcccagga gtngagaca agccccggca 2700 

acaCagcgag accccactcc tacaaaaaaa aaaaaaaaag gaacccgada tcaagcticac 27 SO 

cga'caccgcc gaccccgagg gggggcccgg 27 90 



<21Q> 31 

<2L1> 1417 

<212> DMA 

<213> Homo sapiens 



<40Q> 31 

ttttcctttg atcaaaaaaa cccaaaaaat cacaaaatga tgtcccatat gagttcaaca 60 

caigacgccg gaggagcata gagatagacc tagactaggc acgtgnacgt gtg-gtg-cgc 120 

atgtgtgcac gcacgcatgc ttacgcatgc gcg-gcgcac gcatgctcgt gtg-gtg-gt 130 

g-gcg-gcgt gtagagcctc ggecaccccg acagagcaaa gacacaggag ggtggcacac 2 40 

ggaagaacaa gtgaccccac cctcccctgc acagccaaaa tctggccaag tgagagggga 3 00 

gatgggagag gggagagggg agaaaggaga agaggcaccg accggagggg ccgaagc;ct 3 60 
gtccctcctg ggcaggcgnt c.ccatccac acccccctcc t^ggatagag aggataagca 420 
ggccaaagac gcacgaaacc tgagccccac tgtagcccca gacttctaga aaagtcaaca 430 
gcccczgcat ctctagccga tcctccg~cg ttcaacgnc: gca^caccgc accgggagac 540 
acctgacaga ctgggcctgc cgcaggccac agcagacat- gggcagcccc agaacgaagc 600 
cgaccgtcct tggaatgtgc cacaggggtg Cgacgccccg gccaactcca gcgccgccta 550 
aaacggcccc itgcaacatt cccczcnc-c catcctaaat cagggacttg aagccacaaa 720 
atggcaaaca cacagttccg gcagtcgcte cgagcaicgg agaaatcgcc ccggccatct 78 0 

gttutgtccc cagcatgttc ctcacggaat atccacggat atatccatgg acataacaga 340 
catcctgcca aggcagagc- cggccctcga gaac-cggca agcccagcgc ccgcctggac 90 0 

tcctgcceca tgncccaccc agcgtctgga gaaaagctct gagagaaaga cgaatgcctg 960 

aggccacaca gcccagaag" agtcaagagc acaggcccca gaactagccc cacgtgggcc 1020 

gaaaccccag caccagcgcc cgccggctgc gcgaigtagg agagcccctt accagctctg 10 SO 

tgcctcaczc gtctcac-ccg taaaaCgaga acaagaatcg gccgggcccg gtggctcacg 1140 

cttgtaat:cc cagcacttcg ggaggccgag gngggcggat cac^tgaagc caggagccca 12QQ 

agaccagtcc ggccaacgcg guggaaaccc cgtccctgcc aaaaacacaa aaactagcca 125Q 

ggcgcggzgg cgggcacccg cagtcccagc tacccaaaag gctgaagcag gagaaccgcc 13 20 

Cgaacctggg aggtggaggc tgtcagtgag ccaagaccac accactgcac cgcagcccgg 1330 

gtgacagagc aagactccgt ctcaaaaaaa aaaaagg 1417 



<210> 32 

<211> 1906 

<212> DNA 

<213> Homo sapiens 

<22Q> 

<221> SITS 
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<222> (617) 

<223> n equals a,c,g, or c 

<220> 
<221> SITS 
<222> (940) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1451) 

<223> a equals a,C,g, or c 
<220> 

<221> SITE 
<222> (1901) 

<223> n equals a,C,g, or c 
<4QQ> 32 



tctrggagctc caccgcggtg gcggccgctc tagaactagc gga:cccccg ggcigcagga SO 

attcggcacg agcccagc-t ctggcgg-c-g ctggcaacct ccggngcicc tiggc-tg^a 120 

gatgcaccae cccaacctcc gccttcatcc tcacgccgca tcctccctg- g-cgtgcacgg 130 

ctgtctctgc gtccaadCCt: tccccctcca taaggacacc articadaccg gaCtagagcc 240 

caccccaacg acctcatcct aacctgacaa tctgaaagca ccctac;tcc aaataggg^g 300 

acactcarag ggcgtCagga cctcaatacc ticccgagggg acacagctca acccataaca 3 SO 

cctaccaacg g-ttctggaa tca-cuctca aaacaaacza tccaaactct aatcttcgwt 420 

tcagggtcag ctcctagcag aactcaaCgn aagacaccct; tcnaaagacg gcaccccaga 43 0 

ggtacagaca gattg^gaac cctccccagc acag-g^aac aagccccttg caaaaacccc 54Q 

gaeccgagcc aacactgcaa tzcczzgczc aaatctaaga acatgcctyc cagccccccc S0Q 

caagactatc tgggggnagc tgkitictagg gcggcccaan ttaticg-ct cgaacgcuga S60 

tggccgtgca ccacaccgcg gctcaacggg tgntgaggac acgcicacac gcaacaaatg 720 

cacagggcCC tgccaccagg Cggcactagg ggaaggacac ttycagcata c~gcagaggc 730 

attctcttcc agcc-ggteg cccg-caagc acctgctgag atgaccggcc cagacggaga 340 

ctgcggggtt gctgagccag ccctgtcccc tcctcacccc cggaagatgg aataaatgcg 900 

tgtcagaagg gagggngccc cccatctcgg grgcaccggn gtcgcccctc agaaaaacac 9SQ 

Cgcctgcaca ccctgeagcc ctgaaacgaa tccgagtggc aactcaagcg ggcagagctc 1020 

gncttggac^ ttagacagtc ccaccugcgt gcccccctcc :ctgctccag ctccgacatc 1030 

tcggciccac acacagtiggc c-gaageggc acacggaccc tgagaagagg agaggcaaag 1140 

gwrgcagccg tggccggccc ctcctgccyc cnggccrcc- tggctggtga gggaagaaca 12QQ 

aacacagtcg tgtctccagg Ccacagctgc cagggctcca cctgtggggt ggg-ggc~cc 12S0 

agtsccgctc tgagx:caaga aatgctgcaa acacgtcctc cctraaagag mcctiaggaa- 1320 

tgccacctct tcccgcagct ttctgtagcc aataagcaCt tgaggaacCg ragaagggcc 13 30 

cagccctgaa ttgcaaggga aaactgtgtg agtg-gctcc agt-aagaaa aaagttaatc 1440 

ctagtgagac ccgctcgcct acaaaacaga tgrataggcc agacagatg- acagggacga 1500 

ccccgactcc ccttcg-ccai cgcaggaagg Cgggg-atgt acgacccctg g-caagacca 15S0 

ataggaggcc gggcgcagtg gcntacgccc gtggtcccag cactctggga ggccggggcg 1S2Q 

gg-ggaccgc ccgagg^cgg gagttcgaga tcagcctggc caaca-ggag aaaccccgtc IS 3 0 

tccactgaaa aaaaaaaaca cagaaccggc cggg-gtggt: ggcatgcctg cggtcccggc 1740 

Cgctcgggag gccggggcag gagaatggc" tgaacccggg aggcggaggt Cgcggtgagc 130 0 



cgaggtcgcg ccaiiigcacC ccagcccggg caacaggcgc ggaactcggt caaaaaaaaa 
aaaaaaaaaa raactcgggg ggggmaccga acccggg-cg cacatc 



<21Q> 33 

<211> 543 

<212> DNA 

<213> Homo sapiens 

< 2 2 0 > 



20 



<221> SITS 
<222> (3S7) 

<223> a equals a,t,g, or c 
<220> 

<221> SITE 
<222> (37S) 

<223> a equals a,t,g, or c 



<4Q0> 33 

ggcacgagaa aatattgacc cct:at:ctiggc cutcaccaac cgacctcgaa aagcccca-g 60 
agatgccctt cctztaacgtg acatcgtcca gcctcaccgC ttttacctca a&tccaaccg 
cccacacact tgaccgrgca gtcaggag-g accggccccc ccrt:gt:cctc acccaCgcac 
gtucggagga gctgattcct: gaacccacac tcaacc-cta ctgccaggga aaCgctacac 
casttctcta actggaagta caattagagn gacg-tggta ggg-agaaaa agagggagcc 
acttgacgcc ttcagg-caa tcagagctat gggcgccaca ggcccgcc^= tcraag-gac 



430 
540 
543 



120 
130 
240 
300 
360 

aeattcacac cuaatactca gaccaggttc tgaaagcctt ggggg^c" C t-agartitta 42 0 

acccccacct ictucacggt: acaaa-atgc acaaaagaaa aagg-ccnac accctcccac 
acaaacttac aaacaaaCCC tgaactcccc ccjctcaaaa aaaaaaaaaa aaaaaaaaaa 
aaa 



<21Q> 34 

<211> 1452 

<212> DMA. 

<213> Homo sapiens 

<220> 

<22L> SITE 
<222> (233) 

<223> n equals a,C,g, or c 
<22Q> 

<221> SITE 
<222> (596) 

<223> n equals a,C,g, or c 
<22Q> 

<221> SITE 
<222> ( SO 7 ) 

<223> a equals a,C,g, or c 
<220> 

<221> SITE 
<222> (1275) 

<223> a equals a, eg, or c 
<220> 

<221> SITE 
<222> (1234) 

<2 23> a equals a,t,g, or c 



<4QG> 34 

gaatteggea cgagatcaag ccg-gcact: taaccgggtg aactgtacaC gtragtcata 
cacctractg cagtegtwac taaaaaacaa caggaggeca -gtgggccgc taggagragc 
aatgcc^gcv cccagccacc aggcagagac cagggcagga cagagkagaa ngggctg^gc 
Cgcagac:at ctgtggtacc caac^gatgc acaaaacagg gcgcga-ctc ccgcaccgct 240 
aegcazgagt. ggattcccag CaaaiCgcgc caggccgcc- gancgaegtg cggcCtgcgc 3 GO 

ttcggatcgn aatgctcacc tatgecacte aagczacata ccctgtggcc cctgtggcca 3 SO 



£0 
120 
130 
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ggactg~ggg 

aggaaaaaag 

acgcacaccc 

acagaaaccg 

ttggganacc 

tgacntggcc 

tcctcsaagg 

g-ctctggcc" 

tgtgggtgca 

gagaagtccg 

agtcccag^t: 

ctttcgtgac 

gaaggaaag" 

gacgaacttg 

acctgagccc 

acaacaacaa 

gctcccgTiaa 

tcaaggccgc 

ctgtitccaaa 



ctactacccg 
ca-ctctgag 
agg~ggtgag 
cccagagaag 
Cgrgactgg-c 
gctcatcttg 
ccaggcagat 
gaacgccgca 
ccgcggggag 
gcc-cacgcg 
cgcactggga 
gat-ccucct; 
gtcccccngn 
cttgaaggca 
aggagtccga 
caacnacaac 
tcccagctac 
agngagccad 
aa 



kagugactcg 
cacagggcgg 
ccccaaggat 
aacgaacgga 
cctgctacca 
gcacaccccc 
gcagcagctg 
tccgtcg-gc 
gagng~cacg 
ggatccgggc 
agaaccaaaa 
gagaccccct: 
tccccactgc 
gtctg~aacc 
gaccagccng 
aacnacaaca 
tugggaggcc 
gatcgcacca 



tcaggggaaa 
acgagctgga 
aaggcggaat 
ggacacaggg 
cgccaactta 
gctccacaca 
tcgggccagc 
gacagccaca 
ataagaacat 
g-tgccGCGa 
aagrczgcca 
cctaccagcc 
tcaccctggg 
ccatcaccvii 
agcaacgtga 
acaaccacaa 
gagacgggag 
ccgcattcca 



ggacccacag 
tgagctgccg 
ttgccccaca 
ctccg-ggtic 
gccccaaacc 
gccacataca 
aaagaggaar 
aczgcccagg 
tggcc ctcag 
ggaaggctct 
gcctcutcag 
caggagtggg 
gcattcacga 
gggagccaaa 
caaaaccccg 
caaactgggc 
gatcgcctga 
gcccgggcga 



cc~gcgcagg 
ggcaagagcc 
gctgrcctzgg 
ccaccncccc 
cacctctgac 
tacrggctgc 
gccccggaag 
ctmccttgTiG 
CGtGccnggg 
cngcacgticc 
CGtg-ccGcgC 
actc-Gtgga 
acaigcgccc 
gargcggatc 
CctiGcaccaa 
tggatggcac 
gcccaggagg 
cagagggaga 



420 
430 
54Q 
500 
650 
720 
730 

a40 

900 
950 
102Q 
1030 
1140 
12QQ 
1250 
13 20 
1330 
1440 
1452 



<21Q> 35 

<211> 2S0S 

<212> DMA 

<213> Homo sapisr.s 



<22Q> 

<221> SITE 
<222> (1653) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1555) 

<223> c equals a,£,g, or c 



<220> 

<221> SITE 
<222> (285Q) 

<223> n equals a,t,g, or c 



<4Q0> 35 

gagGCccaga 

ctttgggtct 

gctntgagtz 

tcgtgtctcc 

tttaataacc 

agaacggtga 

taacacggtc 

gagttcgaa^ 

gttggccgat 

caaccgccgg 

atatcacccc 

tctggtgcca 

caccactcgG 

ttgcaggctig 

GttcaJig" Ct 

caaaagcctg 

taagtttttg 



czgatcgcag 
ucaaaacg" - 
gtatacagaa 
aacittacac 
taactataca 
ccttgcngac 
acactcctcg 
aatgtgttgt 
eg-ccatggt 
caaaacaccg 
CGaaaaggga 
actaaaaacc 
aaaaacccgc 
accGcaaccC 
caaccaagaa 
aaacctaacc 
tccaccccct: 



Caagtictcgc 
gttt ~ ttaaa 
acaaatgticg 
catgaggtga 
gaccaaagcg 
Caaticagac 
Catgacaaag 
atccagaaac 
tgagcacgac 
ggcgcactat 
CctcGtcagg 
ccacacgggc 
tcaaaactcc 
aagccccgat 
ggaaaactca 
tcgagrctaa 
caagtaaaaa 



gacti^aacaa 
atcg"cc"LC 
ggaattgctc 
agcaccaggg 
ccctacgtitc 
ccggcaacng 
cac -caaCcc 
acagccctac 
aaaaacacct 
"CC"c~ggac 
tcaaaaacca 
tcacaaccga 
aaaaaaatac 
aacatctggc 
cagacactcc 
atttiuczgac 
aaacatctaa 



tgaaagaaag 
ggg-gaaC" C 
cucagaaacg 
aaagagattc 
cdgcccta-g 
aaa~aatgtg 
cagaacagtg 
ccgtzcc-aaa 
cgccgaaagg 
aaaattcaca 
cgcczacgc" 
gg-Gaccaat 
tgacttgccc 
aag'cagcac 
tattagaCiG 
accggccccu 
ccac-agtta 



Cagaaagatg 
acCGagtgc- 
CgcccGCaag 
ctcogacucg 
c'ad-gtccc 
cccagaacaa 
ctggaacrGZ 
caaacaaccc 
caagct-agg 
gtiCauGCtiGu 
gaag" cccca 
aaCccacctC 
tttagcgac- 
agccccg-ga 
tacaaacccc 
tctaacactg 
gcGcncaac" 



60 
120 
13 0 
240 
300 
36Q 
42 0 
43Q 
540 
500 
660 
720 
730 
340 
90Q 
96Q 
1020 



22 



gggaaac~ca ggtaatgaac tgcugacctc 
Cagagggacia tccacccaac cacticcccgc 
aactatcata gc^ccatccc cccaaccticc 
tataaaagca ccacatg-gc ccccaacccc 
tcttctcgcc ag-cncaatc gttaagcaca 
tgaaagagag cacataaata taccccagtg 
actattgcac ttcctcattia tgrgccgggt: 
tttaatataa agacccttga cacagngaaa 
cccaccccc; cccctgccac tgracccccc 
tcgcaecgca cgcagcaggt: actgcttctc 
aatactctgc ccagatgcag ccattcaccc 
tcttagacaa aaatctucc" egtaC-tacc 
acgctacccc aggacccccg acaccaaaaa 
aacaotcccc ccaagaaggg -cg-gctgut 
ctctaag-ag ccccctttac ctcnaacc-a 
tgacacaccc aagcagcaaa gaCtcccaac 
gtag"gaaca gacaaagcca anccgaatac 
agat.tcaacc aacactaccc tatgaatccc 
tttcatcaag cccttagcaa ctcaggacag 
aaattagcta aacagctcgg tccctagcag 
tctutccctc ccacccagaa acactgcaag 
atgactti^ac aagatccagg cctcagtgac 
aaactaagca tccactca"^ ccatctcccc 
taagtcca" tttgaaaggt ggaggaggat: 
gaacaccttg aaaggggggc atacaaaccc 
aacagttccc atgcacggct cctaacg^ga 
ctctccaccc cccacccca" Cccccccctc 
cacgac--ac gtgctcacca agctccctct 
gtcaaaagct ggcaaaacct ccaaaactgt 
agtgcaaact cacga~aaaa cgcctcaa"g 
aaaacaaag- cactttcata acCtaaaaCn 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 



tctaaagctc tuiaactgat caa:cccg-a 
ac-ccacaag tgcccticccti aaaaaggaai 
acaggactca tgagagaccg c-tg.g^aaa 
cacggccgcc cgaagctcac gacgaaaaag 
gaacaaacaa cngnctggtg acggcctggc 
gaactcacca aagaagacca cacctcagaa 
c-gautcgcn. ttcgtec-ta atgcagcccc 
tcticccaccc caagcg-aag c:ac:cccca 
aCcngcccca aggagcccca acaatttaca 
cagaaagacc tggaaaaca- acccgccacg 
ggncntcaag gctgccttcc gtggagagga 
CgggUEaag- gaagtccaaa cccntacagt: 
agaaaaaaca aaaccttcac acctccta:; 
a.zzza.z.ztzc ca"ataaga aaagti-cacc 
accccacacc aaaCacccga tcaacagaaa 
ccaccattac gaaaagtgtc agcanactta 
aattccaccc cgtttccaga gaccaaacca 
gaacgtgaca acg-gac-ca aacactcaaa 
caccccatac taacctcaag tcagcaaaac 
actgc~caaa agatcaagaa aac"t"~cca 
aaaccccgga cagtctccac cagacctgcc 
atgcccccga aagcczcctg ccagccaccc 
aaagccactc accga^aaag gtagcacccc 
czccccczgc ccaaaggaac tcccctatca 
ggaaaac"a accccctggc cgngcccgac 
ggcaaacc^g cc^ctcccct cagaacacc: 
caatgaacai: t-ctattgga gcccaaactc 
tctccccctig uttaccctg^ ccgtatgcat 
caagaaaatg cccgaaagnc gatccgccac 
Ccatccgga" atgcagnaca cagaaacaga 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



<210> IS 

<211> 953 

<212> DNA 

<2I3> Homo sapians 

<40.0> 3S 

aatccggcac gagaaccaag gcacttccag 
gcg-cgctgg acgagatcag cacctgccac 
cticgttiaagc tcttcgagac tgagcactcc 
aagaagacac tgctggacag gacggtitcac 
gtcagccaca tccgaaagtg tctggagaag 
tccgtcac-g aggcgccgga cgtcattgcc 
tccctcccca tcccggagaa tgacagcacc 
gacccng-ga cggagtccae agctcaccgc 
agagcaccc- ccagagcCga agcagaacac 
agaagctgnc ccaagaggct cgggcagcgz 
gatgacctc- ttzacaaagga aaa'ggcagc 
qzccczzczc tgccgtcaac ccaacactgc 
ctcaaggcag gcgccgggcc cccacgaccc 
ccgccgggaa ccccac-cgg ggaact:cccc 
acccag-stag ctgacgttte acigtaactg 
aaacccccaa aaagcccccc tgczgggccg 



ccgcagactg accacacccc tgtccacccg 

cagc^cccgc acccccaggt cctgcagccg 

cagctggacg tgacggagca gctdgagiizg 

ccgctgagnc gaggccatgt ac"t:cc~g-~ 

ccggacactg acaciztcacc cattcgccac 

cc-ccccata cctctgacct cgtgcaact" 

gcagg-acca ccaaaacgga aggcgagcat 

aaacccaact tcadcatggt gaaccaaccc 
Cccagaaccc g-tgrggaaa zaccczzzca. 

ctcgaaaacg ggcaccgccg ggaggagg-g 

agcttcagtg agaaactgcc cttacaaaca 

ccccaaaccc Cgt-ztcagt gtccactccc 

cccaggacag atctggccgc cagccgcggg 

tccaaagc~g acc'cag-~t tcccacaaga 

CccCaattcg ccaagaacaa ggtaaCaagt: 

gaccaaaaaa aaaaaaaaaa aaa 



1Q80 
1140 
12Q0 
12S0 
13 20 

I3aa 

1440 
1500 
15S0 
1S20 
1S80 
1740 
13QQ 
I860 
1920 
1930 
2 040 
21G0 
2150 
2220 
2230 
2340 
2400 
24S0 
2520 
2530 
2540 
27QQ 
27SQ 
2320 
2330 

2soa 



SO 
120 
130 
240 
300 
3 SO 
420 
430 
540 

soo 

6S0 
720 

7ao 

340 
900 
953 



<21Q> 37 



71 



<211> 3364 

<212> DMA 

<213> Homo sapiens 

<4Q0> 37 

acccacgcg- ccgiiccc-ga agcaatg-ca accctaccag ccaagcacat cacttag^cc SO 

ccaccgcgag atgagggaca tgtgctiaaa t-gtgaaaca aacacaegag tcagg-at" 120 

ttccttcgag tccaagcggt tcacgacccc ctcucccgzg tccc-cguat: acgcgggagt 130 

fcttacaatct cttatcaaga atgaaagg-cc ggcccgtgnc cccactgg-g caggctgcca 240 

catttctctc tgttgcccag ccagg-gctia cggcacgtgc tgcrtccggc gtagtigtacc 3 00 

ctgtggacgc accagcangt tctccaaggc cacgaccgat cctccagacc ttcggaactg 350 

agataaacgc taaa"~g-a gcca-ctccg aacctcctcc agatac"t£ cctcat"gt 420 

CtgtuCgtag ggtaaacaca cctgacagca gcaacccaag cacaccctta gaacgaccac 430 

gtacggccag ^gcacctgaa cgtgcgttcc aagg-aggga acccaggaac ggccaactcg 540 

.gagactcacc cctcactatg acaaatacci gagccccccg c-catcccgc ggaacatggg S00 

CTitaC^gggg attaaggccc cgagctccag g-caaatgaa ggttaccaga tggaggCcac 650 

tagggggagg gtgttaaacg aaaatgcttc acaaacngca tgccgtctgc aagcagtcgc 720 

agtcczcctg cccagcccgc agccactggc cacgcagcca tgtcg^ccag cccgccgcca 73Q 

ccggaccatc tctgcacaca aggcagttcc cccgcccgcc tgccaccag- cctccaccct 340 

ctccccacac g-aagcccct agtaaacccc atgcctcaut tgctgccucc gggccccccc 900 

ttcagcccct: tgaacctagc gcczcccciig ctgaggntaa tzaggggtaca gcacaacagt 960 

gtcgtaacac agaaagcgac acccacaggg acacctcccn cacaacacc" cccaggaaag 1020 

g-aaataaaa tgcccacaac ccgtaggcga g-aaicccct: agacaagtcg tcccttaacr. 1030 

cgggaggagr ttgggaagga accnaagcag gccgcagagg ctgggcacgg gagcttgtca 1140 

tggccggaag tcgaaatggt caactccagg cagacctccc ggggcaaagc agcc-ccacc 1200 

accagcagcc czccccccct gccctccaca ccccaccgct cca:ctgaag cc-gaacccc 1260 

ttccagaaaa cngacggaca gatscctcet cccggccaca cacagcccta gaggtcagaa 1320 

ccagatataa tccactctag aagatctccc cctccccaga aacgaicgtc tctgtgcaaa 1330 

gccccgccaa aatagtacgg agacttagac cgag—cact catcaccaac aaccaacctc 1440 

acaaacantc aacaagcagg acaaccat-a tt:ac;g;cac ccagaacccc tcgccccgta 1500 

tatacagute gacc^aagac gccacatcca catggcaCic ccaacc'Cca aactccagca 1550 

gatittaaaa ctaggtggat gaaaacagaa ccattctaat aaacgtagtg tgccagacct 1620 

gaaaaatcat ctggtgagca ggacctccgt aaagttatac gggccacgca cacaagacg- 1630 

aactgaagaa aattaaccca acagagcatg ccgtactcga acgacacaga gactcactcg 1740 

aactgaacta actcaagctg cagaac-ccg agcaagcctg gattgtaaaa gtccgggtga 1300 

aaatagacgg agtacgccc- gactgaaccc ccgcaccgcc ccacatgccc atacaggcgg 1350 

gggattggat ggctgccagg tgaccattgc attc-cc ttc ggatccctan egagaagaaa 192 0 

tgataagaga gggaaaggat atggggcaag aacagtccga aaaagaaagg ataaagticcc 1980 

cagactccct ccacacccca agaagaactt tctgaaaagc ttggattagg tctggcaacg 2040 

gacataacaa gcaaaggact cctggaacgc gttcttggct ctcagcccca cctctgac;- 2100 

tgagcaaatc agctgattcc tccgcccgca aaataacagc ccccctgata icaacactca 2160 

ccccatgagg ttattcagag gatag-gtcg g-aacaatgc cccgcgtcca catcatccc; 2220 

tccacagaga gctcaaagca ctccaca-gc a-Cgagagag aagctcctcg tgaagagtaa 2230 

acagaagcgt Ccacccttcg gaaacgaac- taggccacaa gagcctgaac ttaatgcac; 2340 

gcaggaagaa atacggtaca cagtgaacca gtgggtcaac cgaacccccz g- nccaccaa 240Q 

gagticccccc ctggccccct: gtcccgtgaa ttgaggaaea eggtgagtcc ctacacccgg 2460 

atgggaaacc ccacacacgc aaccggaacg g-ctcccacg acacacgcag agactgaaga 2520 

acagtccgga cactt-c-ga eaacgcccut: cgggcctcgg iag-agc_ga aagacacctg 2580 

agaaa-ccta gc-cagagct acagaacgac actaacggat cccagaaaca gaaacgtaga 2540 

tgtiggagcgt tntatctgrc catcccaccc caacccaacc aacaccccct gagrgcctiic 27Q0 

Catacacacg atECugagcg acgtggagaa ccaaaagagc cccacacgct caggaaagt:- 275,0 

aaccccggcc ccagcaagga aaagaagtag gacccccaaa cagacaagtg caccgggtat 2320 

gtggaagt-c agaaaagctu cccagta:tc cagcccatcc acttggccac tctcaaccac 28 30 

gtatcactca Cgcaccaagc ag-atgccgc Ccacagagag acccaacctc cgccttaggg 2940 

atccctgggg aaaacacgta caaagagaca gtCCCagcac ac~c-agtaa aggcagtigac 300Q 

caagggcacc tagccccacg tggcaacCua gggaaggtac cccggaggac gagacctictc 3060 

ccaaagtctt aagaactgaa aagaacacgg gaagggaatt ccaggccggg agagnagnac 3120 

gcccacacgc cc^cagtgcc taaccctct; cgaacaaaaa aacggccaac tacagaaagt 3130 

ccggtcctac tg-agcctaa aCCgcaccoa gggg-acgag tgagaaacag ggaccaagac 3240 



24 



aaaggacctg t£"gctgcc tmgtttactg ttzgaacag-a g-atgaagta ggtcccgaaa 3300 

aactatgCm ctggggaaaa aaaaaaaaag actgaaigat: acgtcggg~t: taagtcc-cg 3 3 6G 

caggcaggc- acccaggtaa acaaacacgg aaggtgaCgg gagg~aaccc gggcuggaaa 3 420 

tacagattcg gaagtcaccc cataccagtg g-gtttaaaa Ccaagagcaa atgaaattgc 3 43Q 

acaaggagaa Cacacagaac gaacaaacta ccatgggtga agccntgagt aa~acagaca 3 540 

ttcaagaagc aaacaaaaga caaggaaccc atgagggaga c~ggaaagga aaaaacagag 3SQQ 

aaaizaagaaa aaaegagagg agagaaccga cacactttcc tcaggcg-gg cattazggag 3 660 

tccactgg-g tctcaccaga gaagttccag tccagtggcc agggcagaat gaca"gtg- 3 720 

GUtgttteaa agraaacggg tiagggtaaga aag~:gagaa ggg~oagcac aaaccnctcz 3730 

C-caagtcac t-gccccaga agagaaggaa gggzatggtc tctgggg-gc aacccagg" 3 340 

caagaagcaa aaaaaaaaaa aaaa 3 3S4 



<21G> 33 

<21I> 1411 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (1395) 

<223> n equals a, eg, or c 
<22Q> 

<221> SITE 
<222> (1397) 

<223> a equals a,t,g, or c 
<220> 

<221> SITS 
<222> (1401) 

<223> a equals a,u,g, or c 
<220> 

<221> SITE 
<222> (1403) 

<223> n equals a,t,g, or c 
<40Q> 33 

ccggcccgga a^tccegggo cgacccacgc g~ccggcgtg aaccaccgog cctggccgga SO 
agtc-ctaaa aaacaaagog atcccacccC CcCaagcota cagagaccag accaggegaa 120 
tgtaactggg gaaaaccaag acggeacctc Cccgcactai; cccgccagac acCgtaccoc 13Q 
atg.cactcac gectaggaca cagrgegaaa actaaaaagc tcagagggca gatgeaa-cg 240 
Cggcaagtga ccogccaaca aagcaggtgc agecaoagaa gccggcacag g^atatcooc 3 GO 

aacggtgctic tccccctggg cttgtctttc cgotgttcto t-cccctaoa t*cagagc:c 3 SO 

ctCgagaagc gataaacacc cccagct.ct:c caacaccocc cccaoaccat cccaccacac 420 
ccacctccca gcatccacce gcactcagcc aagggcggga aaccgaccca gtgcccgngt: 43Q 
CgcagaccaC CCc~gagg~c cccaccatcc aaggaggcac ageegtea" accg^cctcc 540 
acgcctccag cagccccccc cacagcoaag gtacaoacca ccccccc-gc cgcgcc-cca SQ0 
cccctggcac caaggtcctic tgccgcttac gtctaaaggg atcacccata t-taac-gee 660 
Ccagcgaccc aacctctttc ctcccatgtg ccagatgnta agacgaagga ggaacacmac 720 
acaiactcaa gccccagcco grtcagtege ttccactggg gczcgczzt' ccgggacggc 730 
aiitcatcatc agact:ggcaa gcctaacccc acaggoctac aggaag.agg gacacoctta 340 
taaaacaacc gtgtcccccc cacactczgc cacg—aata tcGgctinoca acaatttgea 900 
gctgtcccac ccc"ccccca tctgtccc-a ag-tgaaggc tetgeeggag gggacagagc 950 
acaggaacag ccotgacag- ccgtaat:ac cgtiacagaca t:ctaa;agc atacaaataa 1020 
gtatatccct tccatcicga aacaaaaacg atcagacac- gcocctcgcg tgeotgenge 1030 
cxgtggcatc cc::t;taaa aagactg-ca catattaaaa tagegcaca" acacaaacac 1140 
tacctccccc gctgtacag- tgegaiagag actgaagac- tcaCttuccg cgcgctcccc 1200 
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ataagaaaaa aatcaa^iaca ctaaagaacc tzgccgacgt gattgcaacg tacccacg-a 

acttatccac tctcgaacgt tccticcgtac c-ttaaacct tccatCaaaC. aaggtc'caa 

aaactcaaaa aaaaaaaaaa aaaaaaaaag ggsggccsc- ycaraggacc caascccgcg 

t'acgcg-gca acgaaancag ngtcgagngg t 



<21Q> 39 

<2il> 1132 

<212> DNA 

<213> Soma sapiens 



<22C> 

<221> SITE 
<222> (496) 

<223> a equals a,C,g, or c 
<22Q> 

<2 21> SITS 
<222> (1152) 

<223> n equals a,t,g, or c 



<40Q> 39 

.ctaagctggt: tcgcccgcag gcaccggtcc ggaactcccg ggtcgaccca cgcg-ccgac SO 

aagcacccat g-ctctgaac acgaacgtat tcgtaaaaca ccacg-ccca tgtgcgaaca 120 

tgtgctctta cagtacaeag tgctatcgtg caacaggcct tacgccg^cc tcacccaacg 130 

tgtgctaaga cctagcccca tcgacccttc cagaaacgca gtatcgcttc gacctgccat 240 

3SQ 
42 Q 
430 
540 



gtggcactcc acaatgtcaa tegcagt"a cacaca:cgc ccaaagtggg ggacacc-gg 
gtgccccwga ccccctggca ccggaiacag gccacgataa acacccutnc gtgcgtitccc 
ttctgcgccc grg-ggcatg cgcacccagg aCgggcctai aggtcacaga ggccag"tc 
tctccggccc cccagacctc cctcagaacg gtgactgacc cccccactcg aggccgcccc 
ttctccctat cctcgncagc acttgtatcg ccacactacc eaatctitcgc cagtctcacg 
ggtagatag- ggtscagcgc ttcamcaiac actcatctga tcagcattaa tttggggaa: SO0 
tttttcac;t agcctccccg gtteceetec ctgtgcattg cccactttcc cacggagcct SS0 
cctatctccc ttggttcailC ctcaggagtc gctcgtaca- cctcgggcaa tngcagaiaa 
tcccaagaat gcacaccegg gc-gggcacg gagg-ccacc ggtaatccca gcaccccggg 
aggsccaggc agaaggaccg ctgcagccca ggagttcgag actagcccgg gcaacacagc 
gagacctcgc ctctacaaaa aaaaattiaaa aagggggcct tgggaggcca aggcgggcag 
atcatgaggg caggagactg agacccccct ggccaacacg gtgaaacccc gtotccacca 
aaatzacaaaa aaCtagctgg gca=ggcggc gcacacctgt ag-cccagcc ac-ctggagg 
ctgaggcagg ggaaccgcct aaacccagga ggcggagatc gcagcgagcc aaggccccac 
cactgcaccc cagcctggcg acagagcaag gccccacica aaaiaaaaaaa aaaaagggcg 
gccgctctag aggacccaag cntacgtacg cgcgcaigcg ac 11S2 



720 
730 
S40 
9Q0 
:a 9S0 
1020 
1QS0 
1140 



<21G> 40 

<21i> 2457 

<2I2> DNA 

<213> Hamo sapiens 

<220> 

<221> SITE 

<222> (1S22) 

<223> n equals a,-,g, or c 



<22Q> 

<221> SITE 

<222> (1713) 

<223> a equals a,t,g, or c 
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<4Q0> 40 

ctcgacccac ccatccgccc acgcg~ccga tcataataaa aagtatcacc aggacagg~a SO 

aactcaccca tctaccgtga gccgctggct tggagaacga ccttcgagga gccagcctcc 120 

tgaggagaaa tgcctccata aagcac-cac tgctttgtaa aaggagacaa aactgatcci: 130 

aaatgaccac tccagggrtg ctgatt:ctgc tcctggccca tgtc^gcc^a gcaaaccacc 240 

agcaagc-gc tgaaccaggc tggaaacaac aztgccgcaa ctgggaggga catagagtac 500 

tg-gaaagca gcgtcccaac acatctccac cctcacaaag ggacaggcag agacucccaa 3 60 

catagaggtc cttaaacttc gaggggt'ac aatcccctcg caaacatttt tatagctacg 420 

gaccctcccc tcagacctac agccaacatc tccagcgcac cctaggaggt cactttagtc 430 

cacgcccc-g aaaaagccgc actccaacgg ccgaagagca agccatacgg ccgagaatgg 540 

ggctccctct gccrccacga aagctactitc ccccaacacc aagacccagc cgzacgttcg SOQ 

cttagctcag ccacattcac atccttctgg gtgaactgta ccttttgag- acc-gcctto SSO 

actttctaga atcagaccca acaaggzcag tagaagccng ggcagcagcg ggcccggaaa 720 

gacgaggcag ccagcacgaa ccgccg-Ccc ccccctgacc acaaggcgrc gtctccctcc 73 0 

aggg-ccaagc aatcg-ccct gttccgcctic acaggaatg- ggggaggaag gacg-caaca 340 

ccataaaatg ctgcgtccta cct.caaact.c g-actgcgag aaagc^ggaa acctccaccn 90 0 

gcacagcggg cctggtttgc acgctcagtc ccacttgcgg gaaccgcccg tccaagagtg 960 

agctcaggtc agggcagntt g-gcc~acga gaattagctc agccatcagg caggttttca 1020 

■ gacaccctcc taaaatgcgc ccgcgzttgg ttcgtcttgc tcac-gccag tcctzgggcca 1080 

aicaaaggcc igtaaggg-g agaactttca tgcacngcta caccgcaagt gcizcaaaaca 1140 

gagactggcc cagatgaggc attcctcaaa tgn-cgttga aatgaatgga caaactc.ca 120 0 

ggataaaccc caatctgccg gcaaccgcta ttcgactcta gaaggcaaac tgaCCtzacc 12SQ 

ttagagaggg gaaggggagg ggaggc~cat cagccccctg gcagaaagag gactac::ct 1320 

gcaaacgaat aggcctccac cttaagtagt gacagccccc aacttcccat tatggagtga 1330 

g-ctcgaccg ctctccaagc ccaacagaag ticaagacig cctctcagtg a-cagggaaa 144Q 

ttgaagcttc taaagctccc ggtctcagta ac.tcct.caga acaaacccc- tcaaaaggga 1500 
tattgatgga aatgtacaat caccagtaac cgaggntcta tctgagggga tggagatgat 
gaaatggccc ctccttggaa g~tg"ggca ccttggcctc accttccaca aataaagcga 

anccacutaa aacgatcgac aacgactata tagcgccatg cggaatacaa tagatattaa 1530 

tctgtggticg gtttccctgc ccgccttaaa tgnaatgtat catgtttccg ggtcccctct 1740 

ttagctgcaa aaatactccg tcaccaaagc atgaaatcca atcagcagtt gttcttcaag 13Q0 

tccccgaaag ctacarragc tcctcacgac ttgag-ggct tcttccctgc ccaccagagg 13S0 
agaaagcccc cgtagaattc tgcagtgcna caag-g~tcc ctacaaaaac cgaaaccatc 
agctcctctc taacaagttg gctctctaaa agcacgcaac tacaattcaa zggcactctg 
taaagtgg-g ccccaggcat aacctaaatc ccttntaacg accacacccc ctcaaaacct 
tgaaagaaaa atg^g-tcct ctcgczgcac cccttgtaag aagactgcca acagaggaaa 

aaggactt:a caaaccaaga ccaccctggc ctcacctcca caaaga-gag aacaaatcat 21S0 



ggtgtcagga aaggatccct agaagaacac aagaacctga aagcccctgg tggtcatcac 



<4Q0> 41 

gaccccntag ctaagcacac aagg-aactg gccccgcct- gctg-n-tgc ctcacc.gct 
aacgtgcaca tgctctgcca cgtgzacccg ctgctagtgg gacacgccga mcccccagcg 
ggcctcacgg gccagaggaa gutgcg—gt. tccataaact cagccctcag aagngctgcc 
tcctct-gccc ggaatccgcc cacctgctcc aacacctgaa tgccgctcat cccccaagac 



15S0 
1S2Q 



1920 
1930 
2040 
210 0 



2220 



2340 
2400 



cactatactt cacacctcca cagaagcgac ttagccaagc cccgcattcc gagcccgccg 2280 
actttcatcc aaaaggaacg aaaggccgaa aatccaggcc gctgcgtct.g tagataaagg 
tcaaaccacg ttcgagcccc ccactgtcgt gcccacctaa ataaaactiga gcaagtaatg 
aaaaaaaaaa aaaaaaaact cgaggccgac ggcatcgaca agcttgacat cgaaat; 2457 



<210> 41 

<2H> 1S47 

<212> DNA 

<213> Homo sapiens 

<22Q> 

<221> SITS 

<222> (1279) 

<223> a acuals a,t,g, 



SO 
12Q 
130 
240 



27 



tcaat~ctgg tggcacct:ca g-tcngaccc tcccccagtc Hzzczzzcac tnc-ccccac 300 

aggccgcccc ccc~"gacac cccgcccgcc tgcuacccai g-ccccccct: cccggcgaca 3 50 

gtgctacagc ccc-cctcgg ggaaccaacc ttzgnccgaga g-ggacagaa atcacccacc 420 

cctccaccca ggga-gaacc aatgacctgg ccaaccacag ccgc~g-gac tggtcc-ggg 480 

acaggcagga gactcaagc- aggccacaga gcaccagtigc tgagactgaa ac-aciggga 540 

aaacaaCacg c-cztzczgc ctgaggcgcc tag-ggatzac aacggaagcc t~gaggcagc 600 

tgaccatcct tgccttgaga gaggcggctg cccaagaagg aagccaadcg agaaaaagca SSO 

ggcagagaaa aaaaacagga acagacacaa caggcaagac tcagctgac" tctcacgcca 720 

gcaaacacaa gagaaggaga c-ggtcaaga cagagatgtc cgaegacact gagctcccga 730 

ctccagccac "cccgaagtc attcc-aagc cccccgctac acaagccaac acacgacc-t 340 

gktagaccaa aaacagcctg a.czzzqgzqz Cgccacccgc cgnagaaagc acycacgatg 900 

atacactctiy ccgtgcgcty ccacacccta attcaaccty ctycacagca ctgactcgga 9 SO 

ggccctacga gac-gkgaac tctycgaggg caggaaatgc atctcactcg cccccaaggc 1Q20 

cacgagacct ggcgcacgat agacatccaa tacaggtrcg tgaaacaaaa aa-gaaggaa LQ8Q 

tgttcaccta gaaatcaacg aagccctnca Cccac-ccct Cac--caggg cccccctigcc 1140 

aagaactctc aagatgcctc aggatctcgg t-gcagccag gcggccg-cc tccccagagc L2Q0 

ccctgccgtc agccctccga agtcactggw cctggagaca acagggaggg caggncccga 1250 

ctgctgagaa agtccaggac cccggacccc accactiacca- cgctacccgc ccactigggc 13 20 
ttcacccaag gccacaccca tccccgcacc cctccccaac ag-gg-gagg ggcaggagca 
cctggaca-c aagaa-cgag tgcatgcctg aacctgrcca tcacccaCgc ttctgcagc- 
ttgcccatgc Cicccccccc ccgaaacgc" cctcccttgi: ccacctaatt: actcaaagcs, 

cagaggaaaa aaaaCcacct gccccaaaac gizcCcccccg aaccczgcag ggaaccaaac 1550 

tcaaaagcca ccagaaggcc gaggcgggcg gaccactcga ggecaggagc ccgagatcgg 1620 

cctggccaac acggtgaaac cctgcctcca ctaaaaacac aaaaa"tagc caggcaeggn 1530 
ggcaggtgcc cgtaacccca gcaactccgg aggzgacgca ggagaacigt tcgaacccag 
gaggcagagg tcgcagcgaa cccagaiitgc accactgcac tccagcccgg gaaacagagc 

aagaccaggc taaaaaaaaa aaaaaaaaaa aaaaaaaaaa actcgag 1347 



133C 
1440 
1500 



1740 
1SQQ 



<210> 42 
<21L> 2597 
<212> DNA 
<213> Homo sapians 

<4QQ> 42 

ggcacgaggt tacacctcac cccctacctc cgctgacrcc agaccccaga ggc.gaggga 
cagtagacac gcccrcctgcc attgccccti cccttgtgca egtccgag-g gcgccagccc 120 

iao 

240 
300 
350 
420 



SO 



gatcagaetc tgttgcagte ccccg-gctt gccacaatgc cggccgcgct tcttacccaa 
cgccgtttct: gcccccggtg g- tcgngcta aaCgca^Ctc caatcccaag tggtacagag 
aagaaaagga ccgccc-caa aaagcggc-c tgaaeg-gtc gaaagcaagg ccgcccgc-g 
tgtgcttttg acaaccgccC ccccagagga gcctctgaaa acctcactcc tcaagacgcc 
tcaaccccta tcagaaagr:g acctaaataa aaccgccctc ccgaccatcg cggcnctgga 

tgccaggc"a gcgaGtacs.c gcccaacccg agtictccagg gctagaaccc cacaccgtgt 430 

cttttcacta ccctcccaga ggagg-gaga ggaggtgaaz agacacctgr. cagacccrtc 540 

tgcctggtgt caccgtgggg ggacgcacac acaccggagn acaggctzgc taaagagaga SQQ 

cCgcattgct acctcaacta gacacgttca cttiCcccgc aaa:gac;ac tgagcaccca 550 

tggtgtgccc gg-accatgt tagggcaaga gacctacccc acccctgcac tg-aaacccg 720 
tagcacccgt: ggcaatgttc cacaccac;g ccctcccagg gticctgcaa gcngacrccr 
zzza.zzzzcc tcgcgcacgc tigacgtg-ccc aaggcaetcc agcgggccac tgcgagacca 

zc-cccca.cc aggacctc-c aacccnggca ctaggaacat g-cgggaaaa g-agg-ccc" 9Q0 

gtcaagggag gaagaggggc ztzcczgzqc gcgataggat gctcggcagc atccc^gncc 560 

ageagcacca ca.Zcza.cczc agctrg-gaca accaaaaatg tctgcagaca tcgccaaatg 1020 

zczcczcccz acttcacatc taccaaagtg c~ct-cacg- ccnccacagt caaggaaacr 1030 

taggaccagg aagaaactcg ctoaagccaa' acag-gag-c ggcggtagag cctcagctac 1140 

aatctaagcc zzqzczzccc aacccagtgt ccatcctacg gacccatcca gtggcctgcc 1200 

cccaaatgcc caacaaccaa tca^ctggga caaacgaaag ccaaacaaac ttgcngggag 12S0 

cgtgggacaa -ttccacggc gnaaacact." ccaccacggc tgg-ctcaag ctacccccat 13 20 

ggcgccacrg aatg-Sgaag- cgggaaggga tccgtacagc cggacccggc gagcuggcgc 13 30 

cagccagccc agaacgcccc caagtccacg Cgngcugccg atcacacatc agtcccgccc 1440 



73Q 
340 



23 



tcctctggac ctcgcccrgc caccccccag acacacgccc ccatcaccgt gaeg-gccg- 1300 

ggggaaagca gtcaccCggc aagag-tccc gataactccc ccggacaaac acaaagcagr 15 60 

tagag~ggag ccaccccccn crcccgnzaa atgaaagccc cccctcacag tcectccatt 1S2Q 

gaaggaatta gaaaag~cat gggatacccc tggtccdaga gcagaaacga agttccccag 1S3Q 

aagggggtcc cggacagaae ggncccccgg gcagggacca gacgccgtca cctaccacca 1740 

gatgtgccna ccgtaagacg cttgggaggc atcaaagcgg tgacag-cggg gggctg-tgg 13Q0 

attttacca- tctggaacat aagagaaggg gt:g-ggccgg cg-ctceagcc cagacagcag 13S0 

ctgtgaccac tgggagcccc acuccccaca cctgcaaaag gagaaacagc gatccccccc 1920 

ctagtgtcgg tgraaagagc agacgaaccc ccgcctgagT: cccagaCgcc cgctgcccac 1930 

ttggtcgcct ag-ggaccgg ctggccgtca acagtgcaag cttatcaacc ctaaatacct 2Q4Q 

agttgg-cct gac^ctccca cctcgaccac gac-cagng^ cacgzgcucc eagccatccc 2100 

gtctgtgg-c caacgceagc cngggaagca ccngggaccg agggaagaac ccccagccag 21S0 

Ctactcagcg acccaggtcc tcczcczgcc cczgatatca cctcatcata caaccctggg 2220 

aaacacutcg gcgigactgg aacgnggata cccccagggg aagggcagga gcacggtagg 2230 

gcacctggac eccatcatga aggcggtagg aaatactcga agggzcctaa gcagggacga 2340 

cacatcatca aatgcgcgtc ttgaaaact; cttcctgcag aggacagggc cacagcccaa 2400 

gcaggatcaa accagccagg agattgatg- aacagtcccg tcagaaagcg atgaggcagc 24S0 

tgggtgcggt: ggcccaCgcc tgcaa-ccca gucaccgcgg gaggctgaga ciggcagaca 2520 

gtttgagacc agcccttggc aacacggt:ga aacccctC:; ccacagaaaa aaaaaaaaaa 2530 

aaaaaaaaaa aaaaaaa 25 97 



<21Q> 43 

<211> 3116 

<212> DMA 

<.213> Homo sapians 



<4Q0> 43 

ggcgataacg aaagtgg-gg tgg-gatggc gg-aacactg g-ggtggcga cat-ggtggc SO 

ggtgg-gatig gtggtgacac tgg-gatiggt ggcgatrggtg gcggcgctgg tgaccctgac 120 

acggggtcca gtagcagtga cagcggatgc aggcticccgg tgactgagga gca:ctcagg 130 

cegrggaggc acctccgatc cccgccactg ctccccaccc cctacagcc* cccagcaaac 240 

ctgctgggcg acagcggact cagatgcccc ctggaatgtc Cgccgcagtg cccacctccg 3 00 

gttcgctcga tccgagccac aacagcatcc cccaggaaag tigccccgcac ccgctggaga 3 SO 

cactgccccc ccgcccacgc gtccgggagg cctcagtgaa cctgggcccc gagcagagcz 420 

tccggaccca ctcccccaga gaggaccagg ccgggaagac actcaggcta agtgagtgca 430 

gctcccggcc agagcacg-g tccaggctgg ccaccggc&t gagcaagtcc ccgcagctga 540 

cggagcncac gctgacccag tgctgcctgg gccagaagca gcCggccacc ctcccgagct SOQ 

tggcggggcg acccgcaggg ctgttcagcc Ccagggtgca ggagccgtgg gcggacagag 660 

ccaggg-tct ctccctgtca gaagtctgcg cccaggcctc aggcagtglc accgaaacca 720 

gcatccccga gacccagcag cagcticcgcg tccagctgga acctccccgc caggaagaga 730 

atccagaagc Egcggcactc aggccggccc actgtgaccc tggagcccac cacagccctc 3 40 

ccgycgggca gccgatggag acatgcgcca ggctgcrgca gctcagcttg tctcaggeta 900 

acctctgtga ggacgatgat gccagctccc tgccgctgca gagcctcctg c'zgzccczct 960 

c-gagctgaa gacaititcgg ctgaccccca gctgegtgag caccgagggc czcgccc3.cc 1Q20 

tggcatccgg Cc^gggccac Cgccaccact cggaggagcn ggacttgncc aacaaccaac 1030 

utgacgagga gggcaccaag gcgcCgatga gggccctcga ggggaaacgg atgccaaaga 1140 

ggctggaccc cagccacctc cngctgaaca gctccacccc ggccctgctc acCcacagac 1200 

Saagccagac gacccgccng cagagcccca gacegaacag gaacagcacc ggtgacgccg 12 SO 

gtcgctgcca ccttcctgag gctcecaggg ctgccaccag cc~agaggag c-ggactitga 13 20 

gccacaacca gattggagac gctggtgtcc agcactcagc taccaccctig cctgggctgc 13 30 

cagagcccag gaagatagac ctctcaggga atagcaccag czcagccggg ggagcgcag- 1440 

tggcagagtc tctcgtecct cgcaggcgcc Cggaggagtc gatgcttggc tgcaacgccc 1500 

tgggggaccc cacagccctg gggccggccc aggagccgcc ccagcacctg agggticccac 15 SO 

acctaccact cagccatccg ggcccaggcg gcgccccgag cccggccagg ccccggaccg 1S20 

atccccccac tcggaagaga ccagcctggc ggaaaacaac cnggctggag gggtcctgcg 1S30 

CCtcngcacg gagcccccgc ^gctcagaca gatagacctg g- ntcctgca agacngacaa 1740 

ccagac-gcc aagctcctica cctccagccc cacgagctgc cczqccczgq aagtaaticnc 1800 

gctgtcctgg aacccccucg gggatgaggc agctgccgag cnggcccagg tgccgccgca 13SQ 



29 



gatgggccgg ctgaagagag tggaccngga gaagaa~cag accacagcct tgggggccng 1220 

gctccrggct gaaggaccgg cccaggggtc tagcacccaa gccacccgcc cctggaa_aa 1980 

ccccaccccc tgcgacacgg cccagcaccc gaagagccag gagcccaggc iggact-cgc 2040 

.cttctutgac aaccagcccc aggccccczg gggcacizcga Cggccccccc aagacc-ctg 2 ICQ 

gaa'cccagcc aagtgatgca cccaaatgat- ccaccctccg cccaczggga caactgaccc 2160 

aggaaagaag agcctcggca gggcgc~ctg ca.czcca.ccc aggaggaagg atacg-gngt 222 3 

cctgctgcag tccucaggga gaactcctt- gggaaccagg agctgggccc ggacaaagga 2230 

gcaccctgca ttacgcggga cacgcgtgat: caaccgggga catgcgacac acaacgaggg 2340 

tgtcatgaca acgcacgaca cgcacggcca tatgcggcag tgizgaccccc tgacacgtgg 2400 

cgtcacacga aagncagtgt ggcacgcgcc ctgnggcatg ggcgccggca ecccaagtag 246Q 

caggatacac gatig- tggt: ctatatacga cacaiigacaa atgtccatgn cacagcac-c 2520 

acggccggcc agacgacccc aggccggccc aagatctaat ccatcaatc- ccaaagcaaa 2 530 

Cacataccca tagactgtgi: gtatggagca gccaagncag gaaaagtc-n ccgcccgagc 2540 

tgggagggga gagtgnccac gcaccgacca gnccaggggc :caagggc:a gggccccgga 2700 

acaagccagg gactcagcca Ctaagccccc tcctgcctca accctcagcc cacccaccca 27SQ 

taaacccgat gactcccccc tcacecacac ac~agcct:cc aaggacaggt: ggaggcaggg 2320 

ccagcctggc gggagtggag aagcccagrc Cgtcccacgc aagggacaaa gccaggtcta 2 330 

atggtac-gg gnagggggca ccgccaagac aacaagccag gctaccgggt ccagctacca 2 940 

ctttggcggg actcagg-ga gtccccatgc acc-cacacg ccacccagtg tccccgciac 3 0 00 

ttccaaggag aaccaagaat ggcuCCgnca c'acccgaagc caggcttgat caataaacac 3 0 SO 

aacgg-accc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ctcgag- - 3 LIS 



<21Q> 44 

<211> 24o0 

<212> DNA 

<213> Homo sapiens 

<4Q0> 44 

acgcgticcga tgaactcgac cggccaaatg ccccs.cr.ttc ccagacggc" tctaacgact SO 

Ctcaggatca gggccagccg cggguccact cc-cgt"gga gcccaictca cctgggacgc 120 

ctgcagccag ccctccctcg tgatctgtcc caccctgagti aggagacatg czzczccccZ 130 

aacct-CtCcc t^tccgccac aaccaacaca tgtcctttzc agtaagtcca tgcctctggc 240 

aggggaCgaa gaagtactca ccgg~aaeca gciiacca-cc ctgcagcagc cctzggcaacc 2 00 

tgaaaaat-- gggcccggtg ccgtccatga guccccg-gt aaccgcaaaa gcaggaaagg 3 SO 

aagtcaagac ccccgtcgcc ccg^gctcag caaagcagcc czza.zcczzz acaciccgct 420 

cCUgggtzci: gc-cttgtcc tgttccatac caggcaaac- gctcagcagc aaagggacca 430 

aactgaaaag gtgacaacc- ctaactecca aaagcagaca ccaaccggac gctcattaga 540 

ggttaacgaa gacgccatcc ttggtggcct ctgcacccaa actgcacctg gaaagaacca SQ0 

gggtctcact cagaacg-cc aaaaggaaaC tcctaagagc ttaaacccag aCCtgtgccC SS0 

cattaaCgca gtgaacaacc caaaaccaca cagatccccc ggcaggaagg aeaatggaac 720 

aacagng-tg acgagacctc tccagcttca aggticccgga gcccaaacaa atggatgacc 730 

cattiggaa- gaaacccaca acgcaagtrag aaggaccucc ccaaaccagg ccagt-ggg- 640 

Catcctggc- cggaatccgg tgtgaaacca Cagg^citaa cacnctggag cagcacatcg 900 

ctgcgga&ar g-ccaggaga ccttagatac ggc"aaagg ctstcaagac gaggacagaa 9S0 

actgctcaca accgcccagt: Ctcccaacag aaagac-cac aagagtgcca gcatggggta 1020 

catggag~ga agctgggtgg gaagcaccac cngcacagtc cctg"cccag tgcaggacc" 1030 

zzczczqzaz gttttcacac catgggatcc tcggacatca gcgcact-cg gcncccgaaa 1140 

cagccnaata gctgctcaca cactgggtag gaacaCcaca ccaacgtcaC ccccaaagga 12 0 0 

agggtgagct gaatggaaac caagcccagc cattteact- gatcna-nag czczgz~s.zc 12 SO 

agtgcacgac cacccagacc acccccccca gcccacacag Cgc-gaacca tcttccctcc 13 20 

cgtcccccac ggcizaceaat agtacagc-a aactcagagt: gcagagccag aCataagcat 1330 

tttggaacca ccccccagtc ;gcgg"agaa gcngaccgga atacaggttg agtacccctt 1440 

atccaaaatg c-agggacca gaaagg--£c agacctntcc agacttngga acacctaaca 1500 

gttgagcacc ccaaacctga aaggcticccg aacgccacgt cagcactcaa aaaagcggai 1SS0 

tttggagcac tccaaattcc ggacitttgg atctgggatg cccatcctgt gcaggagagg 1S20 

ctactcgatt cca^ttaacg accgcccnag tcataaccac ccaaagacaa aagccaggta 1S30 

gacgttgaaa gccctttcca gggctgaaaa agrgizcctta cgttccccgc atg-gaccag 1740 

catcaccctg gaaattaatg ctctg-ccct cacnagaatg Cagtaagtgg ctaaactgag 13QQ 



30 



cZaZccccca. cccgatgact attggcatcc acccgcaagg ccaacggccc ggaccaaggg I860 

tcaggat~ac ttg-agccag aagg"aaccc caccticcgcg aaactaaccg gcccataccc L92Q 

gaggntaggt guggccTitga ccZi3.cca.gz acacccacac ccactaccag CCgac~agcc 1930 

cagacaattg etaaatggcg ccccSuCCcc gczz.czca.qz agacctccac gccatxacaa 2040 

aggaaaeccg aatiacctag tgtcngtaca tcccacga^a accatg~ata acctctgtica 2IG0 

cacagcttac g-actgctaa cactcaagtg taaaccatgc cacagccaac acrcaaaaat 2160 

gaaaaccaat tagctcc-gc tcagggaaaa cgccaggcac gaag-atggc acacacttga 2220 

cactgtcccg Zgzaaccczz tacczcgccc aggctctcaa gaccgag-cn zzzzzccccc 2230 

aaatcagg-i: aacaigcact tgaccccaac ccgcggggcc tgagcaagci: ggaaacctgc 23 40 

gacggnaggc Zz.zcza.gzgz cacgaggcgg cggngaccga aggaaaagcc gggatcacag 2400 

gutccttccg acggagagga aggttcatct ctacgcccct cccaccaccc tccacctaga 2460 

gctcacccaa gcccgcccca gccccagggg caggccaccc cgcaaaagca ggacctcaca 2520 

gaaacaaggg ccgggctgag gtcacccccc ccagagctgg tccctggcca gatgggcaag 258Q 

aggcattcgt aatctcaaaa acgtgaaacc cggg-cccgg- gctctcttc" aagtgcc~aa 2640 

acaagcaagc caggccgceg ata:;ccacc cagagaaacc cgcaagccaa ga~taacccg 2700 

aatctgaagc ttagaatctc gagtccgcac ctgcatcaca tcangctgct ccgacgagga 2760 

aacatttgcc actgaggag- cggagggagg gcaagacgac agngt-aagt caga-catc: 2820 

aatggtctcc ccCaagccct ggaaaaaCac ccgaaagaac ggcagcaaaa aggtuaagaa 2330 

agcaagccag acttaccgca caacatgcag cacccagtac tac-ccaaac cccaagagaa 2940 

cagt-g^gacg cctaatatat acaggtccac gaaaatactg tggaacaagc ccaggaaggt: ' 3000 

Cagaiigtgcc tgcaaacaag ctgcccaaag ggtccccc-c taagcaaaac aaataetcag 3060 

acsacaggct tcaatgcaaa ccgncaaaaa gtgggacgcg gaggac-ttc gtcaagcgcc 3120 

aatcgaagzc aaaaagcaag ggtccttggc caggcgtggc gcccacgcct: g-aatcccag 3130 

cactttggga ggccgaggcc ggcaaatcac ccaaggtcag gagiccgaga ccagcc-ggc 3 240 

caacatggtg aaaccccgec cccaccaaaa acacaaaaaa aicagcccgg cguggtggca 3 3GQ 

agtgcctgra gtcccagcca cccgggaggc tgaggcagga gaaccgcccg aacccgggag 3 3 SO 

gtagaggttg cagtgagcca cgaccatgcc actgcactcc agcccgggca acagagcaag 3420 

actccatctc aaaaaaaaaa aaaaaaaggg cggccgcccc 3 460 



<210> 45 

<211> 2522 

<212> DNA 

<213> Homo sapisns 

<400> 45 

aggttccccc ctc^ggatac ttggcagtcg cact"atgc agetcagcca agaactaaac £0 

acaggccctc aggtgtggcc agagtacagt gggccagagg toctt"ctt; tctccatgtg 120 

gtcatacacc tctgtcaatt caactcaagc ctatattact tccgggggcg ggagtgggga ISO 

agaggatcac attgtcgacc Cgctggaagc attatgtcaa ccaaaatcce ccagctctca 240 

ctttcaccat gccgizuggg" tccccccatg tcgcctcctc tcaaaaacac caaacgcaga 3G0 

acticcctCC tacactgctc ccattccatc ttgttgacct gtagcccatc agagtatga; 3 60 

tccttaetgr caCccaaggC acCctgatat aaccaccgca tgcctaaact ccagtcaccg 420 

actaaacgcc acecgagacc attcccttgg Zzgzca.ca.cz acctcacagt cgaagaaacc 430 

gaacctcaga gaggctaagn accttgCtaa aggccacctg gccggaacac aacacccigg 540 

tcatetcaac caccaatgcc gtcatgaaag ggagtiiagcc agggcaggtg tgaacagagc 6Q0 

attccagcat aagcagccaa caatcagagc aacCgcg-a." tcccagTagcg Cggatagcaa 660 

accticgccg tcccggaagc -ac-atccca gaagcacggt: accaagccct ccatccaacg 720 

CggcgtCcac atcagtagca tcggtctcac ctgggagccc tctagaaatg cagagctcag 730 

gccccactcc aggcctaagg acccaatccg caictcaaga agcgacccac acacacatca 840 

aaggtccaag aagcaccgct gtagtatcct atCCCccaca cZZZa.gZca.Z Ctgcacaica 900 

cttg-gcaac cctggctcca zczgca.Za.cz ccccgaaggg ggcctgcccc tccacacccg 960 

tgggtatutc icgtcaggtg gagatgaaag aatgagaaaa gaaa:aagac acaaagtaca 1020 

gagaaagaac agtgggccca ggggaccggc acacccagca tgcgaggacc Ggcaccagtig 10 80 

ccggtc-ccg a.gztcczzca. gca" cratcg accaccazGc tzactgtccn ggcgaggcga 1140 

gtgtiagcagg gcaacaggtig gcgagaagg" cagcagggaa acg-gagcaa aggaacczgC 1200 

atcacgaata agttcaagga aaggtaczgc gcccggacgc gcacataggc cagac_"acg 1260 

tticactcta cacaaacacc taac^agcag agagcaacaa agcagcactg ctgccagcat 13 20 

atctcgcccc cagccacagg gcggticttcc cccatctcag aatagaacga acgggaatgg 1330 
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tcgactuCac actaagacat tccatcccca gcgacgagca ggagacagaa gcccccatc- 1440 

tacctcaac" gcaaagaggc ctccctcttc cac_actcc~ cctcagcaca gacccc^-ac 1500 

a-gtgtcggg ctgggggatg taaggnc-ct cctttcccac gaggccatac cccaggccg^ 1550 

c-cagtcggg ggaaacctgg acaacaccca ggcrectccg ggcaggggtc cctgcggccc 1620 

tccgcagcgc accgcgtcHc tggcuaatcg agaacggaga acggcgacgg ctcccaccaa 1530 

gcacaccgcc tgcaaacata GcgtGaccaa ggcacaccct gcacagccct aaacccacca 1740 

aacctigatc caacatagca cacgticnctg ggggcacaga grtggggcta aaccicacaga 1300 

ttaacagcac cccaaagcag. aacaacttcc c~tagcacag atcaaaaegg agttcc-cat 13S0 

gtctccctcc tctacgcaga cacaggaaca acctgacccc tct-ccccac acttcc^ata 1920 

tgccaacgta ctccaazcgi: ccgccagcc- ccatig-cccc ccaacctgaa acaaccccac 1930 

ttacaaaaca gccggaaaaa tagtaaagac aattcctigta tccsccncac ccacgt:a3t.c 2040 

ccaaatgtg- aacacccaca accagcaaaa atcagcacac caa"ac~ac~ acccaaccua 2100 

cagacctcac tcacactttg ccag^tgccc tgctggtgcg c-g-acg-ag cgcaggaccc 21SQ 

agtccaggcc cgnacgccgc acttigccgc cacgtctcci: cag-ct:cc~c tcatctcag- 2220 

ggtcrttccG tccccatgac tatgacagaa atgacc-g-a cccn-catac acctgcagtc 2230 

acatgaagcc tatttgtccc actactggng acgtaaatct cggtaactcg gccgagggtg 2240 

gtaccgtcca ggtttcccca tatt:aagccg ccaccttccc ctgcgcactc aacaagcacc 2400 

Ccatagggaa accctccgag actacgtiaaa taccatacca cacgcacccc gcctatctta 2460 

gcatcct"cg aagacaaccg cccgaaacat Ctctcaccg" gatgaectcc eaa-dctacc 2S2Q 

aCCctgcatc taccag^c-g gaatccgatc gtaaggaagc ataitccccc gticccccacc 2530 

aCuitactaa agaaatztac acccaaaaaa aaaaaaaaaa aa -- - 2622 



<210> 45 

<211> 1934 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (106) 

<222> n. equals a. t Z.g, or c 
<40Q> 46 



agtcgcdgct atgaaigatt ttaagccttC tcttttaagc ctggcaaaca ccccagccaa SO 

tcaaaacagt caiacccctg agaagcagga aactaaaac" tcctcncaca caattgtcag 12 0 

aaggttcgtt ccccaaacta ccatagccac aaagg-gaac agccaaactc caggaacaga 130 

accaaaagaa caaaaatctg Cgaagagacc ccaccactcc tcctcccatg tcccggcctc 240 

ggtccccact gtccctatca cttcagcaag tggaacagca gcaagccctt: cagtgcacat: 300 

gctgcacaag aacaaaaca" aaacccgta: ggcaccaaaa ancaaagtga aaaccaaacc 3 60 

aaaaacccaa acaccctacg ^aactatcgg aggcacanac gcggtacaaa egaccgcagc 420 

Cgtgacacac acatggctac ccgzcacasc acttlccaca attatceacC gcaaaatgac 430 

tgagaggcts tcgg^gcagg cagccgtcaa cctcctgctc cctttgttac ctccggatca 540 

ctittgcagta aactgcaggt cccccaagag aCtcaagccc cagtc;tctc aaaccaaacc 60 0 

aaccatcctg cctcacctga agacgcaggg tcg^gggcaa aagaggctgg tcalaacaac 660 

gcccncatac cgagtggtct gtaaacggct gcacacccca ggcaccgtag ttgccgaaga 72 Q 

cgc-ctgcta aacgngaccc tgaccggctc cacaggggtg cagaacgcaa cctacacaag 730 

gtgacntegc acctacctcg ctctcgaggc ggaCgaaatt gagaagctigg agcgcg~aag 340 

ccatgcacat aagtattccc caccgtaaac ctcg- fctcca c-c~naaccc aaccacggta 900 

ctttacccaa tgcacaaccg acctctcagc agatactcac ctgaaaacag cgnggccctg 960 

atcagtgaga aagggaagga gaaaagtgac tctuctgccz acgcagaaac gactcactcg 1020 

ctgagagccc gncttcctgc agcacccttg g-ataacgtc agcgatcggn ccccc&cccg 1030 

actggggaaa gttaaCgtct ccgaccccgg ag-taatcca gctgagtcat cttatacctt 1140 

CaggaagcaC cagaatcgcc ccgacgaaca acaaagccga ccgntctgac gtccaa-ccc 120 0 

aggttccaga atacagtrggt gtaaagtccc actatctcca attcttaaaa caactctaac 1250 

tccgtacacc ctaaaagtca cacgcacaag gccCg-ccag agagcagagc cuccatccct 1320 

ccgccccnnc tccactctgt actzcacctg raaaaatatc aagtgacttc acatkgrata 1330 

tccccatzgt aaccctgaca tctctcaaag acaaagcacc tctcgatcac gaaacacacg 144Q 

aaatcc-cgn gtigacgtgga tcacagtccc tcaggctccc tcagataacc gcctacgaat 1500 
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atcgticcraa c~cCgT:gt:aa gaagag^aga aatccttgc- aatgtcagaa gg-ctgcact 1SS0 

■ actgatccag aatgcactcc gctag-ctcc aacgga~ggg agagt:aaata acgccgcatc 1520 

cacaacttaa taagcuactc tcccttgacc cccaaggtaa cccutLCttt cccgrcaact LS3Q 

acagcac-ga agccatagna agtgaacgag accaccagtx cucagggctg gttccagag- 17 40 

actgtaaacc aactiagctgTi cc-cccaaag ag-o-acaact cccactzcagt acaccgga.a 1SQ0 

atgggcgtg-C gggtggggct ggggagggcg ggagacagtt tgtagaaaag aaaaaagaaa L3S0 

aaaaaaaaaa acacactxac cc~aagaaaa cacagacaaa aaaaaaaaaa aagggcggcc 1320 

gccctagagg atccaagccc acgcacgcgt: gcacgcgacg tcacagccct tcnacaggg- 1980 

cacc 1334 



<210> 47 

<2L1> 1337 

<212> DNA 

<213> Homo sapiens 



<220> 

<22L> SITS 
<222> (442) 

<223> a equals a, eg, or c 



<22Q> 

<221> SITE 
<222> (444) 

<223> n equals a,t,g, or c 
<220> 

<221> SITS 
<222> (473) 

<223> n equals a,C,g, or c 



<22G> 

<22L> SITS 

<222> (493) 

<223> n equals a,c,g, or c 
<220> 

<221> SITH 

<222> (1011) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 

<222> (1025) 

<223> n equals a, eg, or c 
<220> 

<221> SITE 

<222> (1111) 

<223> n equals a,t,g, or c 
<220> 

<221> SITS 

<222> (1119) 

<223> n equals a, eg, or c 



<220> 
<221> SITE 
<222> (1169) 



<223> n equals a,t,g, or c 
<22Q> 

<22I> SITE 
<222> (L234) 

<223> a equals a,t,g, or c 



<400> 47 

ggcacgagac 

ttcgacacgg 

ccaggccgga 

gctaagggcg 

cagtcaggcc 

tgactacccc 

tgatctggac 

ggggtccgtc 

tggagcgaac 

tccgagggtg 

gcattggccc 

accacttacg 

ggcaagccga 

accagcaacg 

cggixccccga 

grgcttcccg 

gcccgggcaa 

aagcr.ggaag 

ggggcaitcg 

atatgacccc 

tcagga-cca 

tcgcccggag 

tccigctgat 

agggagcaat 

agcaggccga 

tgcgccctcg 

ggccggggac 

gtactcaaaa 

ggtctatcgc 

gta'cgcccgt 

Ccaacaggcg 

gatcgccg-g 

tagggcccgc 

aaaaaaa 



cggcgccagt 

atggcgacgg 

acgaccctga 

gcagccaccc 

agtetcactg 

Cggggccagg 

ccgcacgact. 

gcggtcaacc 

gcucaaggca 

tcgccacgag 

icggcccgat: 

ccagccagag 

ctcaggaaac 

cccccgcagg 

cgccggtgac 

accgcaccgg 

acccacttcc 

titaagtaacg 

acatcagtcc 

acccccggcg 

aatgggctigg 

tagcgtgccc 

cagcccggta 

ctcgcaagaa 

aagcggcctc 

ccactcgacg 

gaaccccgna 

cccggggccg 

gcgacatcag 

gcgggcagcg 

agcagcacga 

gccccgcgcc 

acciacccca 



cctgcg-cgc 

Caa cgaacgc 

tcgaggtcct 

agcccaacgc 

cccacgaca- 

g-gaacgcgc 

Ctigaagcgaa 

ancnagaagc 

cccgggaaac 

agg-ccagtg 

gccgatcccg 

tgcggcutcg 

acgccaggcg 

tctcgaccgc 

cgccgacgtc 

ctctgcccac 

cac" tgcgca 

ccggcggccc 

ccggcgtggc 

acgagccgnc 

tgaaagacca 

atggg^cacc 

ctcccgctgc 

aaaatcaccg 

aggctcggcg 

gccgcgcgcg 

tcgaccg-ga 

ggaacgggct 

cgggcgccag 

ggcccggcga 

tgggcaagga 

gaactcggca 

tagcaccccc 



acagcaggac 

cccccccgxia 

ggtacccgcg 

aczgaaacac 

gcccgaccgg 

gccctccacc 

tatcgtcggg 

gaccaccagc 

acecaccccg 

tgcggt" cca 

gggg-gaccc 

ccgc-gcagg 

ggcaCgcccg 

cccaccggac 

ggggagggcg 

gcccacg-ctg 

aagccgaccc 

ccgagcaaag 

ngacaagcnt 

cacccgcagc 

ggcncacagc 

acctcecccg 

tgccggcact 

gcagcctitcg 

aguccccggc 

cgccacccgc 

cccggacagg 

CggtcaacCc 

tgggcacticg 

gcgcttccga 

ctgcacaggc 

cgagactaat 

tctaccccct 



cgcgctgcct: 

acgccccacc 

ctgaccctcg 

accgacagcc 

atccgcgcca 

acctcggticg 

tccgccgggg 

accagtcggg 

accagccggg 

gcc-gg-gga 

aaaCtgcacc 

atgatgcacg 

gc~gc z.cgzc 

ggag~accac 

gcagcaagcc 

gaagcccgcc 

aacacccctg 

cggcgg-tgg 

ggctcaacgg 

gtgcggaccc 

gtzggncgcgg 

C-cgaggcag 

cctcggagtg 

gcagcgcaac 

gcaggcgcca 

cgaatcggct 

gatgaaagac 

ccggcagagg 

Ggctcgg-cc 

cgaac cangt 

ttcaccctga 

tcccctaaaa 

ccagagccaa 



tccgazgacc 

cgcaaccgac 

Cugccggcc- 

ccaggacgac 

attcggg-gc 

cgagtgtgcg 

aaagcgccaa 

acnactcccg 

t^gcagtcga 

gacgccgctg 

cagctcccga 

gcccgccctt 

cgccgeatca 

cggcagcccc 

ggaccggacg 

ggcagcgaac 

aagggcgggc 

tggataacgt 

tcgtcagccg 

cgccgacac^ 

cactacgnaa 

gcaggcacgg 

tctccgatgc 

cagtcgticcc 

ggcctcgccc 

tcggtagtgg 

ggcgtGCacg 

cccaggccac 

cggagtctgc 

ggcgacaact 

tcgaatcac" 

acctttgaaa 

aaaaaaaaaa 



60 
120 
130 
240 
300 
350 
420 
430 
540 
600 
6S0 
720 
730 
a40 
900 
960 
1G20 
1030 
1140 
12QQ 
1260 
1320 
1330 
1440 
130 0 
1SS0 
1620 
1630 
1740 
1300 
13 60 
1920 
1930 
1937 



<21G> 43 

<211> 2113 

<212> DNA 

<213> Homo sapiens 



<40G> 43 

ggggaatatg 

agatggag-c 

aagaggcata 

Cgttacaagu 

CGGagg~ggG 

gcgtttcaac 

aatacagcaa 

Ccttacagct 

tgcGaatcca 



gatggc^c"G 
tcactccggc 
ctccccggac 
ttaaagaaca 
acacatgcca 
aitttctata 
aacctacccc 
aaatacaacc 
accaaacGCG 



tCCu"cgccG 
ccgggcaaga 
taaaacggaa 
ggtc;agcca 
gng-ggcgcc 
tactaggcag 
tcacacgcat 
aagacacaga 
ttaccatgcc 



CG-ccgttac 
agagcaaaa" 
cgccc-cccc 
gcgcgactau 
cagacgccca 
tctcccttaa 
gaccccacta 
acacttccat 
caacccccca 



-tec- ttget 
tcngtciggg 
aaactgztaa 
tgcag^gcaa 
actgaagega 
gne-eggt" 
acaCuaacaa 
cactacagga 
cc.cc.ca.cccz. 



atcgt&gccc 
gggcaggggg 
aac-agg-ag 
Caaaaaggac 
aggacaacac 
atcgngatac 
acctaaacag 
agctg-ectct 
gagaaccact 



60 
12 0 
130 
240 
300 
360 
420 
430 
540 



3^ 



gcfcctgctt:- tctgaicata tagtctggcc ttzccaaawa tgccacacag tcggaaticac 60 0 

atagtaigca tgaaccatgt agccgaagcc tccac~ccac "caggraaac acccaggaac ' 650 

aggag-gc-g ggttacaccc taagcgc^aa actccacaag aagccgctaa actgt-tccc 720 

aaagcacccg taccacttcc ctcccccacc agcaacaaag caccca~cag tctacgcac- 730 

caccag-gct agtgcggcca gaacgcacag tctcaactiac acacainaca atagacgnag 340 

agcgg~atcc catcgcgat- ctgcctitccc ctaacgaata tctcccctca cataCtcgtc 900 

attagtigzgt c~"ctccagt aaatggnccc att:g~T:tt:gc ccact-ccaa aagccgag" 960 

tccatatcglc ccaactccga gagctcctca tatactcugg acacaagccc tctgtcagac 1020 

acgngacctg caaatatttc cccccagagc ttttcctgcc ttgccagtcc cncaacagtg 1080 

CCtttaggaa ggagaacagc CgcCttCaac tctgacaaag cctgcaattg ctuCccccct 1140 

atggaccatg cttttggtgt agcatctgag aacncccc=t taaaccagcc acacaCgtct 120Q 

tctgttctc- ttgaagctct acaggttgag gt:acacc_tg g^ccacgacc acttttgagc 12SQ 

Caccttttac atataggtag Cataagatac gg-tcgaggt: tcccgcccgg ggtcet"cc 1320 

cttcctgcac atgaatgttc agttgtccca gcaccatctg ttgaaaaaac acratcc-tt 13 30 

ctctagcaa C etgctccccc actnccgtca gtcgactatt cg"gggtctg tttttaaacc 1440 

ccatctcaca ttaigtccac attaegaact tcacagncag ic-Cgaaatc mggtaaCg-cg 1500 

ggtczzcica. ccegtgctct ecaaaactg- cccggcticci: c;aaccccct cactcct-cc 1SS0 

atataaacct tagaaamagc ttaCGgaitc ccaccacttt cccccaaaaa gccacctggg 1S2Q 

aatt-gacca agtitiacacc gaa:ccaaca aacgactctg gagagaagtg gcacc~cagc 1680 

aacacagcct: c::ctgacaa cacccccng- cttagccctc tccctcaccc. aacaacccCc 1740 

caacactaaa accccggaca aacccagacg cgcacctgaa taaccccagt ttcgrcgcca 13GQ 

ttcaaaatgg uacataaccc ccacC-Cata gatgagaaaa tcaagtaacc tgcccaagcc 1360 

acacagctac caaatgacaa agccagattg aaatccacga gcttacaaac taacg-cctg 1920 

ttttgagcta cagtacCagt cccatcaaac Cgtngtggtt: aaactgaggc gggaggatag 198Q 

tctgagcacg gaagacngag gctgcagtga tceccgaCcc tgccaccg-a ctccagcctg 2040 

geaacagagc gagaccctgc ctccggaaaa aaaaaaaaaa caeca cggaa gcaagcaaaa 2100 

aaaaaaaaaa aag 2 113 



<210> 49 
<2il> 24S5 
<212> DNA 

<213> Homo sapiens 



<400> 49 

agaaaatgaa agcaccaaag agccctcttt: gccacaaCat cCCCgtgCgc agtcccccgc SO 

aggactaaa; ggtCicaatg tactizCtaCc tggcag-caa acccccccta ctgtgggccc 120 

gticctcagg- cagctgccgc ctttcagtgt ccctcgcatg gccttaccac ctccacctcc 130 

gggccctictc cctgttccct aitctcccgc aatgeeggge ccggttcccc ctactctcgg 240 

tgctcSccca aacacaggac ctgegaatet cagcctgccc ggcctcggac caacagccca 3 00 

gctccccgcc ggccccacag ccgcggctaa tccaaagtcg tccacactcc ccrccgcaga 3 SO 

ccctcagctt cagagtcagc ccccaceaaa cccaagccca gtgacgccaa ggtcacacag 420 

tgtcgtccaa caaccegagc cccccgt^ca cgcgggacac ccagtctcag tagcaaaacc 430 

acaccagi:ca ccagtcccag cgacccccaa gagcatccaa cgcacacacc gtgagacgee 540 

ttccaagaca cccggcagcc tcggagaccc cgccccgaag agaagagaaa ggaacaatca 600 

cgaaacacca gctcggccca gaggagacca gaaaCcccca gcggcggcgc -gactaaccc S5Q 

gccgctctgc caggeggggg Cgggatcaaa cgccccgaga gCcccggacg tecgaggegg 720 

gacgcaaacc atcccgtcct gagcaeggg- ccttcocctc tccctcaccc acactcctgc 730 

taacttccca ccaccazcaa tcacccgact tccCgaaagt aatcaaccgt gcatctaata 340 

ccagtCagag ctccgacccc gcatggcgcc acagtgaaag cgccgactga czcacggccc 900 

tgattcaaga accgtcccac Cgctggaagt agacccgaat aggaCaccgg agccttgttt 960 

tcccaaaggg gggcgccg~c tagcacccaa ctaggg~aag caCizcc^aac atgtattzcc 1020 

acttgccccg agzaaatctg eggegagaga agc"^ccnt~ ccgcagzttia aaaaagecac 1030 

cgcttcccta ggcttcacca ggaagecanc ttcag-cgt:g aaccccacgg cgctazc-ac 1140 

tccgt^cctg aaatgggati: cagtgcaaaa agt"acaac cacagcctci aacacatcct 1200 

ctccagggta cgacgacntg aacgcctaca cc;ccactc~ ataac~cgcc ctgeacznat 1260 

t tcacaaccc agtaa;aacg eggacaaacg tauccacatg acaca:gtca agaccaaaat 1320 

aactgtgaat gacacacctt gctgtaaa-g aaccg^gctia accctgactg egggctcgag 13 80 

aacaaagatg aactccagaa ccccagcagc ccaaccgctg cttctcaaa: aactg-g-ga 1440 



35 



acagtgagac accactgttc gttzccaaaa acccxactgt gcccact;cc ccccac-aca 15QG 

tgcccngca* tctccacc^a aanacucagc cgtagcgcca tcagacacgg acgcccccta 1560 

acaactgctg ttcgcaaaac aaatcaggac ggragaaagc gatctcacgg aaaactggaa 1520 

cccggaCgag accccctcat tgaactctga agag-aacga tgcgaaaatc gatacagggc 1630 

aagagacgac fccxetegccc tccticcacc ccacgtccag aagagcaaga gggaaaacgg 1740 

acatacg^c cacacccaag ggtacccaaa ctgtagccag tcggcacctc egaaaaaCga 1SQ0 

gaacggtgag cgcacgggct; ggccgntcca gca~gaacac aaccc-ggaa ac~gc~aigc 1360 

aatttcccct tuucaaccca catcacttca ggggcgcacc aagccgccaa acnacaccag 1920 

Ctctc-g-a- tcctagaccc gctgacattu acccccctcc cgcctc~cca gagnaaaCgg 1980 

tzccczzctc tcctccctac tittccttcac zczczczZcc tcccc^cctc cctactccct 2040 

ZZcZZccZzc ccctccctcc cccaaaac"a tciacagacgc agaacccugg egtagggccc 21Q0 

tatCCcatct CCatcccctg acccaataaa acgt'atacg aaagaatgaa aataccaatc 21SQ 

caagagaccc igggagccng aataaagtag ctccacacca actacaggac aataccagcc 2220 

tcactacccc cacaagactt tcCaaaacct gaggtagg-a gccacactaa ataaatccgc 2230 

tacttataca aaaaccccca tcaacactaa accc~t:aaag ectacaagct: ccttccccccc 2340 

cctttacag" cccccacaga g~caggtcaa aaacgcggtc ctaaccacca acaactgca" 2400 

ggtiaaacga ccctgaacca aaaccgacgg gctccctacc aaaacaaata aaaaCa:acc 2460 

tiltttcaggt ttcaatccgt gcagggcata tgcatg"aa tcc-accatg ctcaagaac- 2520 

cccacaaaat actccacgga gaggtccgca tcnagacgga aacagaaa:c gczzztcccc 2530 

tcaccgtucc tgaacgcccc acactiigntt caacaccctc gccaictctc cccascactc 2 £40 

tgaccacgac acgaccactc aaccncagaa ttcacaaccc c-gaggggcg ccaagaagca 2700 

gtcccactgg tgaggatatc acgacccggc gaccatcctt: aggag-cagaa aaccaaggac 2760 

aattgcttc- gcactcagca tccacntcct. aacg-cggccc cacacgcaaa aacaacaacg 2320 

cag-ggccgt ccctgtcagg aaaacaaatc "cacagaaca ac-ggcggaa ctgaagccgc 2330 

tgcgctagac tcggacacte cgggtiagtga agaagcaatg gcaacctcga gTictaccacc 2940 

gr:ataattca gtaaaagaaa aaaaCaaccg ttggcgg^cc cactaagaga acgcagcccc 3 000 

tct:gag-~gr cacagaggct gtigcgrgccc tacaccgacc agggtctgra aaaccctttc 3060 

attcCggtac aagagtcggg ggtaeaactc ctaCacccga aiactacctiac caag-tcaca 3120 

tttcccaacc ccttectgt:a aggtcctacc CccgcattCa aacaactttc ctccaacgca 3130 

aagccgcttt ccgctcaccc tattgcaceg ctagttigcat gcaggtacta aicccaccgc 3 240 

cgcctacrgc ctttgCCCtc utatcaccca gctccgcccc tcctcctaac ggctatatca 3300 

tctatagcta tctacccgta actgcaccac acgtaaactg atcctctgtc ccgattcctc 3360 

tictaacact tttaggaaaa tannaagc" tacaaaatag caacaaaaaa caactcact; 3 420 

aaaaaaaaaa aaaaaaaact cgcagggggg gcccgtaccc aaCtc 3 465 



<21Q> 50 

<211> 1237 

<212> DNA 

<213> Homo sapians 



<4QQ> 50 

atgggccagg aatggggtcc ccgggcaegg tgctgggcct cccggtagcag atctgggccc 60 

tgcaagaagc ctcaagcccg agcgtgcagc agggccccaa c-cgctgcag gcgaggcagg 12Q 

gcagtcaggc gaccctggcc cgccaggngg accaggccac agcccgggaa cggc~ccg~g 130 

tCaagtggac aaaggacggg gccatcccgt: gtcaaccgea caccaccaac ggcagcctca 240 

gcctgggggt ccgcgggccc cagggacggc tcccctggca ggcacccagc cacctcaccc 3 00 

tgcagccgga ccctgcgagc cccaaccaca gcggggcgca cgcgcgctgg gcggccgcag 3 60 

agaiccctga gntggaggag gctgagggca acaiaacaag gctctticgcg gacccagacg 420 

accccacaca gaacagaaac cggatcgcaa gccccccagg atcccccccc gtgccgctgg 430 

gggtgggaag cacgggtgcg gctgcgaccg tgcggggtgc c^ggrcccgg ggccgccgca 5 40 

gctgccagca aagggactca ggcaacagcc caggaaacgc actczacagc aacgtcctac SQQ 

accggccccg gggggcccca aagaagagtg aggactigc-c cggagagggg aaggaccaga SGQ 

ggggccagag cacccattca acczcctccc cgcaaccggc cccccgccag ccgcacotgg 720 

cgtcaagacc ccgccccagc ccgagacccc gccccagccc caggcccggc caccccgtcc 730 

ctatggccag ggzczzzcoz agaccaagcc ccacccagca gccgaggcca aaagggttcc 840 

ccaaagcggg agaggagiaga gagatcccag gagacctcaa caggacccca cccataggca 900 

cacacaaaaa aggggggatc gaggccagac acgg-ggccc acgcccgtaa tcccagcag- 9 60 

ttgggaagcc gaggcgggcg gaacactcga gg-caggggc txgagaccag cccggcccga 1020 
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acccgggagg cggaggctgc agtgagccga gatugcgcca ctgcacrcca gcc^gggcga 1080 

cagagcgaga ctccgcctca aaaaaaacaa aaaccaggag gacngggagc ctgccagcco 1140 

catcctgaga ccccg~ccnc atc-ccgtiaa cgacggacct: cgcccccacc cccccccaag 1200 

aacctaacaa aggcccgrga agaaaaaaaa aaaaaaa 123 7 



<210> 51 
<211> 1397 
<212> DMA. 
<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (13 33) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1395) 

<223> n eq>ials a,t,g, or c 
<4Q0> 51 

ggtccctagg agtcgagcag gaacaggcac ctgtiggtcta cggcgacctg gczctccgcr 
ggccacgtgg grggtgaggg cacacgag-g ggaagcggca ccgacgtgtc tccccccgac 
cgtggc-ncg ccaaagactt ctaacagcac tttctaagtg caaaacgtcc agg-aaaaac 
ctttaccacc agtgaccaaa tcagaaigca tttaacatag taggcggctc aagaac-gcc 
ttaacg-caag acaaaccgat agcaacatcc tgtcgtttta aaggaagtgg g-ccg-gaca 
ttctzgcagcc agtccaccac tccaaggcaa ctaccgacct ggttccagtg aacctacttc 
gtttccaact acagtgaccc attagcccag tacc-agaaa ctacgcatac ttcg-gctac 
tgtcaccgat gtg-aaaccc tgtccteatc tgtatttatg cact-gg-tc cca--Cggag 
cctctgccct ttcctgggat aagcggtg-c tgccgagaca Cctcccgg-t g-ccagtggcc 
aggagcagcc gagctctagt ctgccagctg ccctgctocc tctgggaagg aggCggcgcc 
cgcccctcag ggtgtctzcca gggctcagct tccggggtgg tagagccggg gagccccagg 
' ggtgggggga cagccgggag atggaggtgg cacccgctcc cccagaccag taccggcuct 
gaggacaggc gagcagtggg aagaccaaag aatggccggc agcgctgcca rggccggaaa 
tgggggcaag atcctggggc tgtgtgccct: ggggccnccc tcaccCgtct cggtggccaC 
ggcc-caggg atggctccta ggcggctgag gcacagcag- ggctggaagg tgccccgtgg 
aggctgaggc ggaggcgcgc ccagcagctc ccccc-gtgg ccatggcggg cacgggscg^ 
aggagcuggc tggcggccgg ccctgcacgt tcetgtegcc tgtcgtccgt aactctagcg 1020 
t'ccgacatcc gccg-gatac agtggtgcca cgacgtgugt aacxgtggtc agcagacccc 1030 
gttccgcgtg gacgcctcaa gtgga"aat cccnggaagc cccaaictg- acgtttgagt 1140 
atctacatga gaacgttaCt tgaacggaat tt-cttaacc cagaaggtag taCtnatiaat 12QQ 
cattcacttg tagcgaactg cccaaagtia acacc-gucc aaat;tcttc acactacagc - 12SQ 
a:itacgcaa tggct:acag aactcatgga gttatcEC:a ccag-atggg aat:aattaa 1320 
aaccttgaat cccaaaaaaa aaaaaaaaag ggcggccgcc ctagaggatc caagcttacg 13 3 0 
tangcg-gca tgcgana 



<210> 52 

<211> 2271 

<212> DMA 

<213> Homo sapiens 



SO 
120 
130 
240 
300 
3 SO 
420 
430 
540 
SQ0 
SS0 
720 
730 
340 
90 0 
9S0 



<4QQ> 52 

cggcacgagc ggcacgagca coggagggcc ttggaccatg aggaggaggc cccgtcaccg 
ggcagtgngc aagaggcaga agccatgtca gacgagcccc aggaacaagc ggagggctcc 120 
ctgaccgtgt: acgcgacacc tgaacactcc ccacccctcc cccaggacac gacgagccac I3Q 
attgggccca agaggacagc agtggtgcgg gggataa-gc accgggaggc ccctaaca:c 
attggccgcc gcatagccca ggtggcccag gccacgcc^c -gaccgagga tgcgcctgc- 



£0 



24Q 
300 
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cctgccctgg ctgaccacc~ tccacaggac aag-ggagcg ccgagaagag gcggcctctc 3 SO 

aagcccagcc tgggcta-ga gaccaccc-c agnccaccca acccagaccc caagccccac 42Q 

gatgnctact gggaca£~ga gggggccgnc cggcgc~atg cgcaacccct cc-gaacgcc 43 0 

ctcggtgccg ctggcaaccc ccctgcggac ccccagactc tctactacgc aacgccgggg 540 

gtgaatcccc gcttcgaccc agcttcctcc agctaccact tggacacgca cagcctcccc 600 

catgccacca acccagtgga gtcccggccg ggacccagtg ctgccccctc g-acccCgcg 650 

ctcaacttcc taccctacgt. gcctgagccc gcacacccac cgcng-acac tcaggacaag 723 

gatggcgctc cagcggccac caatgccttc catagtcccc gccgggg-gg catcacggca 730 

tataacgccg actccaaaac ctacaacgcc tcagngccgc cagtgagagt: cgaggnggac 840 

acgg-gcgag tgatggaggc g~cctggca cag-tgcggc tgctctccgg gatcgcccag 900 

ccccagctgc ctccaaaacg cctgctctca gggcctacga gcgaagggcc aacgacccgg 9SQ 

gagcuagacc ggctgcCccg ggctcggcca gtggagaacc cggccacagc caccaccacc L020 

cc-acctccc tggcgcagcc ucccggcaag a-cagcaaca tcgtcatuaa ggacgacgcg 1030 

gcatctgagg tgtacaaggc tgeagctgcc g-ccagaagt cggcagaaga gttggcgtct 1140 

gggcacc-gg cacctgcctt tgtcgccagc caggaagctg tgacaccctc cgagcntgcc 1200 

tcccttgacc cgtcactccc ccaccrcctc catttccctg atgaccagaa gt:t£gccatc 12S0 

tacaccccac ccu;cccgcc tatggctgtg cccatccccc cgtccctgg- caagatcctc 1320 

ccggagaccc gcaagtcccg gagaaagccc gagaagacag actgagcagg gcagcacccc 13 30 

cataggaagc ccccccccct ggccaaggtg ggcggtgtca gaccgngagg cacgtacatg 1440 

gggcctgccg gaacgacn-a aacacccgtc cccagtctcc accgccggcc ctccagcaac 130 0 

caaagcacaa caccccaaga tgggttcatc tcttctcccc tccccatcca ccnggctcaa 13S0 

tcctcczcca. ccaccagggg ccccaaaagg cacatcaccc gggcctcctt accucgttcg 1620 

acaaggctgc tgcctgtctc ccectgtggc aaggactgc- tgc ccccztig ccccatctcc 1630 

caacatagca cacttgtgca ctgagaggag ggagcaccac gggaaag-cc ctgccttcca 1740 

cacctccctc cagcccctgc gggacagccc cagcccccgc tgtcacgaag gggccaggca 18QQ 

tcggtcacce gcgggacctc ctcccccacc cccctccccc ctagctggct ttgnctgtca 13SQ 

ggtgcagt.cc ggcgggagtc caggaggcag cagc-cagga catggtgccg cgtg-gcgtg 1920 

tgtgtgtgtg tg-gtgng-g tgcg-g-gtc agagg"cca gaaagcccca gac-;ggaa; 1980 

caaacagtcc tgaatccaaa tcctcgtccc tgcacccatt gtctggagag ctitggataa 2040 

ggtatcgaac ccccccgagc cccagtctcc catccgccca aatggcactg acgatgtctc 2100 

ccttacaaga tggttgcgag gagtaaatgt gatcagcacg taaagtgtct ggcgtgcagt. 21S0 

aggctcttaa taaacactgg ccgaatatga attggaatga tacaaaaaaa aaaaaaaaaa 2220 

acccgagggg gggcccggta cccaattcgc cccatagtga gtcgtatcac a 2271 



<21Q> S3 

<211> 2769 

<212> DNA 

<213> Homo sapiens 



<400> 53 

ccgccaccgc cgcccccccc ctcccgccgc cctcgggccg ggccgggtcg agccgcgacg 60 

ccctcggact tcacctcac: gcccagcgcg gacctagacc cggaatcgcc caagcccccc 12 0 

CacCcgcgag accccccgaa gtcacaccca ccacagaacc tccacagagc tggactcaccg 180 

• gaaggtcagc aaagtaccat Ctcaaagcac atcgtgtcag tacgcaaagg ccctccctga 240 

aaCggagtct aaatgggaaa gcacgaaacc tcgcaaaccc acacaacgta gcccCccccc 300 

aaagagtctg atccttotcc cagagcagct gaatgtccca acgggccctg ctgctgcgct 3 SO 

tgacgCgccc gtcggccgcc ccacctgccc cgagaaacat cgaaaaaaaa aCcacctCaa 420 

attaCgtgcc gagcgtccca cetcagaaCC ccagaccctc accccagccc tctacacaca 430 

aaacagaatg tgaaaaggcc caaacaccaa catgaatata aaagagcgac accaaacaca 540 

Caatcagccc gaaaatccag aatccagaac atccataegc cgagcacctc aaaa ca eg z z SOQ 

aactattcat cgcaaaagca tcgacgccat Cgctactgca ccacactcaa gggcaaggaa SSQ 

agccaagtaa cagcgacgac gactgagcta tgcttagaag aaaggggccn agaaagccct 720 

ggaaatgcta aciagagtccg tagaccagct Caaacagctc acacccact: tagaaaacct 73 0 

tcggctncgt g-gtgtgtgt: g-ct-aacac Cctaagacac aaaatagtct zqzgzzzzzg 340 

cagattctga agtgacccca cacacatcac ctccaaacc- ccacaacc.c ggaaccggac 900 

agcaaaggca c-aacttccc caccgcagag aagagaaac" aaggctgaaa ccaagaaacc 960 

aaaaaaaaaa aaaagtccca cagnagaaga aataataccc tczgcgttcc agtcagtgcc 1020 

ctttacgcaa agcggccacc gcgaaacaca cacgtaacca gtaacaaaca ttccacggtc 1Q80 
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tactttaaaa aggccctaaa cattcaagtt tgact-gaga agatitttgc cnccctcgga 1140 

ggtagtccgg gacagugggg ggcaagcggg ttic-gtagnc gagcccaaac ccggatacca 1200 

actcagcccc ccaccagtcc cgngactccg ggcaagcgac ccnacccccc caaacnccaa 1260 

tattctcccc gzcaaaatgc agccgcccct caaatttagc gagctaacgti gca"aaatt: 13 20 

gcctagcata gcgtccagcG taacaaa-gt tagtttcccc c'tcagtatc agc-aaaccc 13 30 

ttttaacagt acaacttaga gcacacaaaa tgcccccaaa caaataaccg gccaaaCccg 1440 

ttttgtacgt gtaagaagcc catagatata cattag-catg aaaatcaaat: act"taaaa 1500 

ctacaacc^g aag~gagaaa t"~cagcag tgttccccag cgaagcagac cccGggctCC 15 SO 

taCtcada^C caacccgcga cca-gc-gta gagacatacc cg-ctg-aca aacccacc-c 1520 

atttcctgga caacaaacca cagag-gtca caaccaCaga gacgaccagg gaggagaaag 1530 

acatgaagaa aaaaacgagt g'taacuact Caagcaaaca Ctcccaguga gctgatggaa 1740 

atacagacaa ccacccagca cgcttatcta cacgtctacg cctaaatac- Ctcgneaacz 13 GO 

tCtcaC-CCa gtsiaagggga t-cccactca aag-aaggcz accctccdag aaazcataca 13S0 

taaaacacaa catgtacaag caccaattcc gtcatgccc; tcaaaatgtc Ctgagacagg L920 

gtgtcctaaa aaagttgtcc gctcctgtaa tgaagcg-cc aatctgcgcc acctcactgc 1930 

tatttccctt tcaccattga cattccaaag aacacacaaa agtagagaga aeaaaataag 2040 

ccccaEatac ccatcaccac agtt-aacca gcacc-aacc ttgcctcccc caccccccct 2100 

tctcicccaa gccggatcac tctaaaacaa atancaaaca ccagaccact ccacacacaa 2LS0 

gtacctctgt accacagcga cttcctta'a ttiaaccatcg acacaaccia accgctaaca 2220 

taattactgc accaaccitc aaecactccg cacagcccat aaaccatt-c ecggaacatt: 2230 

ceagttcatt tcagcaaagc ctcccgtgac tctccngccc tctc&tcaag acagagtgct 2340 

gcaggaggcc accgc- cc~a gCacatagta caccc-ataa cctacgccga aaccgtaggc 2400 

cacgctgcta gatcggagun catatcgteg c-ccgacacc tactcgtatg gccttagccc 24S0 

aalaatctaa Cctcittcgc ttcaatccec tcacccccaa acgggagcaa taatagtgcc 2520 

tcaccctagg tcctcgtgaa gactattcga ggcaaatccg tgcaaagcat tcaac~"agc 2530 

acacgctaag tgctcagcaa acgctagcta aagctaaaaa attuttaaaa cctaaacgcr 2640 

agctgtagag tgtCCactgt gCgccaagaa cctcacaaat aagaactcct gaggcgggca 2700 

tatcac^cgc ggtcaggagt: ccgagaccag cceggcctac acggtgaaac cccacc;cta 27S0 

ctaaaaaaa 2759 



<210> 54 

<211> 1339 

<212> DNA 

<2L3> Homo sapiens 



<400> 54 

ctgcaggatc cggcacgagg acctgttgac cacagaigca agaacctaic tccggagcc- 50 

caattccaac ccattgcect acatgcgcac cggtctcctg gggccgccac agcagaacac 120 

cacagac-tg ggggcttaaa caacaaagcg acccctcaca gctczggagg cCccaagccc 130 

gagatcaagg agtcggcatg cctggcccct cctgaggcct ccctccecgg cccgcagatg 240 

accacg~tcc caccgtg-tc: tcagctggcc cctctccgtg cacgcacatc cctggtgccc 3 00 

cttcccccct tcgttgacca caaggatacc agtcctg-ctg gaccagagcc ccactg-aaa 350 

ggcctcatct caacccaacc cacccaaa.a acttacctcc acacacagta agcccgaggc 420 

tgagtccttc gaaccccggg ggagacaccg cagcccacca cgcatgccga cgtgtgccac 4S0 

cgccacactg cccggttacc gcagagtg-c agnaagcttt gaaa"~agaa agta-aagtg 540 

ctccagttct gtcatttctc aggattaCCa gggctac^c- gggtcccc-c cacccccata 500 

tgaacttgag aaccagctcg t'taCtcccg ctaaaaccta gt:t:gggat:t:ii teetcttC" 660 

gagatggagc ctggctgt:gt caccaggctg g-gcgcag^g gtgcaatccc gcctcaccgc 720 

aacctccacc Ccccgggtcc aagtgac.c; cccgccccag ccccc-gag- agctgggact 73 0 

acatgcgcac gccaccacac ccagctaatt rHZT.qza.zZ-z tagcagagac ggggnctcac 340 

cacgtcggcc aggatggccc tgacccctcg acc~cg-gac ccgcccacct cggcctccca 900 

aagcgctggg actacaggca tgagccaccg cgcccggcc" cagtcgggat tutcacaggg 960 

ctcgtttcga gtctgcaga" caacccgggt agtacngaca ccctaaaacc aagtctaccg 10 20 

atccacaiaa gtgggacgci agctaatacg gacaag-gaa ettactacnc ccccaaaaaa 1030 

ctactttcac aaaccc*a~a ccaacacgag aaacca^gcc gcatcctiect acc-ggaggg 1140 

agcaagaaag tcaggtiggcg cgcc^zccgc acacgggcaa cccaatctcc taictagact 1200 

Ctctttcgc- cgcct:ggaaa aaaaaaaaaa Cgaggcatcg caaaaacg.g acaattgccc 1250 

Gccctag-ac gaaaaaczgc taaaacgagt aaagcc-gcg ctcaaccaga attctcacat: 1320 
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tcccgtatag atgaacccgc accacaccnc gagccgaaca cgacaccaag ctcaccgaca 13 30 

ccgacgacc i3 3 9 



<21Q> 55 

<211> 743 

<2I2> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (15) 

<223> a equals a,t,g, or c 
<22Q> 

<221> SITE 

<222> (17) 

<223> a acuals a, eg, or c 

O <22G> 

M <221> SITE 

If! <222> (29) 

42 <223> a equals a, eg, or c 

5 <22G> 

S <221> SITE 

m <222> (32) 

<223> n equals a, eg, or c 

f=* <220> 

111 <221> SITE 

IJl <222> (64a) 

=4 <223> a equals a, eg, or c 

111 <220> 

<221> SITE 
<222> (741) 

<223> a equals a, eg. or c 



<400> 55 

ggtgcggccg t-t:ananOag ggacccccng gatgeaggaa tieggcaega gcctantgka SO 

tgagcamcag cgacaggcat ctgggactct tccaatc-ga agcagaaagg aacaccacca 12 0 

ccatgaata: ecaagaacga gtatecegge tcagggg-gg agtagcaggg teaeggggea 130 

ggcacacctz cactttcact tcagtaaaaa atgtcyagkg ggccagccac -gegecagge 240 

ccaracgaca tcaccacaag gagacacata cagagagaac cagacaggat ccaaccagga 3QC 

gacaaaccac acagaaact- aaacagegat tgcccaatac acagaatcgt taactacgac 3 SO 

aggggai:c~g agiaagagga actggttacc aagaaacaaa gagaaegcaa acgaaigcag 42 0 

aaatagaccg ggcacaatgg ctcacacccg caatcccagc acactgggag gcaaggcagg 43 0 

tggaccacga ggecaagaga tcgagaccac ccaggccaac aeggagaaac cccgacccaa 540 

otacaaaaag cagccgggcg tggaggegag cgecagnaga ccoagctaca caaggaggee S00 

gaggcaggag aatagaccga acccaggagg cagaggatgc ggaganccga gaccccgcca 650 

ctgcaczcca gcctgggtga cagagagaga ccccaccaca aaaaaaaaaa aaaaaaaaaa 72Q 

acacgagggg gggcccggaa accaaccc 743 



<210> 56 
<211> 4202 
<212> DNA 

<213> Komo sapiens 
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<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<22Q> 

<22L> SUITS 
<222> (53) 

<223> r. equals a, eg, or c 
<22Q> 

<22L> SITS 
<222> (€1) 

<223> n equals a,t,g, or c 



<22G> 

<221> SITS 
<222> (4137) 

<223> n equals a,t,g, o: 



<4QQ> 56 
accagctcaa 
ntgccctccg 
ct-gaaggga 
ctgacggggc 
atcccgcagt 
gtgagggctg 
tccagtcttg 
• cagagctgct: 
aatctcctta 
tcctgttgcg 
taaggaacga 
ccccccctga 
tttgaaceta 
atgcatatat 
aacctccaaa 
acagctgcct 
tgcctgtcag 
ttcgccctct 
tgggaggaat 
cctcaccagc 
cacagtcagg 
' cacacccacg 
tctctacctt 
acaggc-gag 
gaagacagac 
gagaagaagg 
ggaaagggag 
agccttcccc 
gcccagcagg 
tctcgaactt 
ccttgcgcca 
ctgcgcacct 
tggcccaag- 

tcatccccca 
ctccccatct 
tcutgg-cact 



agaCtcacga 

cgtctgtgca 

aggcccg~ca 

tccaacagcc 

gaactgaaac 

aacccaccce 

C-cc-acgaa 

gggCccct.ca 

gctcattcct 

tcccaagacc 

accgggggcc 

aagCgggttc 

tatacctcat 

actccataau 

aatggtactt 

gctgctaggc 

tcgacccacc 

tcccaaacgg 

tgctctgcca 

agaccaggag 

aaggaaacac 

cccctcgtca 

cgcacacccg 

ggr.cc a tggg 

tcccaaaatg 

cacaggtggg 

ccgctgcgcc 

cctgggcttc 

ggctacctca 

ccttcctaac 

acctggaaag 

actacctcga 

aggggacccc 

agagtcucca 

aaaggttaaC 

tcgcacattt 

gcaccgccgc 



g-cccaccga 
Cgcgcgcgtg 
tgtccccgca 
aggctgtggt 
cccaatgaag 
tctacatctc 
ggctccaacc 

aacggggccg 

ccatctcctc 

agggaggaga 

tctgaactac 

g-ctcggggg 

atctggaagg 

agacgggaac 

ccccagccat 

tgcctgccac 

tcacaggga" 

-ccctgtccc 

accggcccca 

ccgaactaaa 

ggtcaccacg 

tcaagtatct 

aagaaagaca 

gaattaggca 

gaagggggtg 

tg-cggcaaca 

tgaccgtgtc 

cccccgtgct 

acrgtgcccg 

gtgcagccra 

cacagagtgc 

atgccttccc 

gggacgacaa 

tgcgtacttg 

cggccttctc 

g-gcacgaga 



grcactgtga 

cgtgtgggcg 

ccctgccccg 

cccczgacgt, 

acagaactg- 

Ctttcaagac 

cgtttccacg 

ccattacaga 

ggatcacaa" 

cccaacactc 

acagggggga 

gtigrccgggg 

cctgacgtat 

cctccgatgc 

Cggaggccaa 

tttgacactg 

ccagrcccgi: 

gt-gatcagc 

tgtgcctcci: 

aggccactgc 

tggaagagac 

tacagagact, 

gctgagccic 

caacgaggca 

ctggggagag 

gccaggagca 

ccacctggag 

cgcgccccct, 

gtccccgcgc 

gcaaggcggg 

gactccagrg 

tcccccacta 

tctcaeggtc 

actggaccgg 

tggctgcgcg 

tgtggt-tta 

aaaccggggg 



gcggagccca 

tgtgtigcauu 

caagaigcca 

tccucacccg 

gcccggggag 

cggaCcagag 

caaacccgcc 

tcgactcaca 

acgg-ctgg- 

c taccgcctt, 

Ccaagaaggg 

gaagr, tcctc 

atacacatat 

Cagaaaaacg 

ccttccacaa 

gggtggacag 

ggacgcgctg 

tccaccagag 

g-cggcaggc 

accacagggc 

acccccgcgg 

actcggagag 

tcggccgcaa 

gtaggaggtig 

accagccccc 

gaaggaagaa 

aggccgac~~ 

cicttgcccc 

ggccctcrgg 

gagcattcct 

agaaccccgc 

gaagccccgc 

Cnagcaccag 

agacaaacct 

tgcctccgtg 

cacttggtca 

aacgctcacc 



tgctggrmcg 

gctgggccag 

aaccccagcc 

tcgccaaccc 

acgcaacgag 

cccatcrcca 

aaccagagcc 

gcagggagag 

ccaa:ctgca 

tggctcccag 

aaacccaacC 

cggacaccaa 

acatacacac 

gaaacaagag 

agccagcccc 

tcgactcacc 

gtgccgagcc 

ggagccctga 

agccat-g-a 

aacttcctgc 

tgtttaatac 

cctcacgccg 

ggacgcagaa 

gggaagaaaa 

ccacaccagg 

gactcaagac 

cataccccca 

ctctgcagac 

gcagcaggga 

ctgcccctcg 

cagccgagcc 

tcxgctctcc 

cagcctag-c 

cgtcagacgc 

tccccactcc 

aaagcacccg 

ggcacaagaa 



60 
120 
130 
240 
300 
360 
420 
430 
540 
S0Q 
660 
720 
730 
340 
900 
960 
1020 
1080 
114Q 
1200 
1260 
1320 
1330 
144Q 
1SQQ 
156Q 
1520 
1630 
1740 
1300 
1360 
LS20 
1930 
204G 
2100 
21SQ 
2220 
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aggacccc-g accccttocc C-ctccgtgg tccccggcat tagatcgggg gccc-gggag 2230 

aggcagg-ga acgecccaag cgaaccgctc tg-ctgcaac cggaacgctc t-gaagncct 2340 

tggtgtrgct ccgtgaaagg acatcgccac ccggtgctca tgaggtg-cc ecgcagaaca 2 400 

ataaaeggca aatgaacaac camaaaac-g tcacycttgt tggccteccg ccgttt:gtag 2430 

attagtgcac ctacctgcga gggattcggg tcacctccct gagcceg-aa gcaaccacaa 2520 

gccctgccac cgggtiggggg aagcccctcc ccaaccacnc aaaaacaaat tctcccacac 2530 

attaccccac cccacacact tgaccccggc cagacucng- tcgcctaaag gaacagacca 2540 

catcgctggg aaaatgagta agngaacgcg Cgggagaaaa acactnutag aaccacgaac 2TQQ 

dttcactttc aaaggtccct ccgccrggcc gcaaca^ag- gtgcgtccaa attzacccaca 2 760 

ggc~gttgt:t tctcaaataa acgttcaatia tcaaccatcc ccaaaccgac aagaacacaa 232Q 

aaaeaaaacg caaatacaga gccagc~ctg ccacccaaac ccgizgtccac ttcngatagt. 2330 

ccatggaatg tggctctctt ggaagccagg gtcgg-ctcc ccacagaccc caggctaagg • 2 940 

tcaccagt-a ggaacccagg acccggaagg cagagctgtg agccciiucca tcagggacct 3QQ0 

gactccgcaa aacgacctga cgaacgcaac tggcaaactc ccatg-tcgg acctcaca^g 3060 

catgagccgc tggacagagg gcttcntag- atatacceta acgcatgttt acgcgcaacc 312 0 

ttgttagrgg gzacacaagt tcgtgaagaa ctcctcatut: caacaggcag tcaangtaat: 3130 

gcaccaaaag cctgggaatc cggggctaca ccotccccct ctgactcaac aacc"caaa 3 240 

gaatc'catag gaaagtcagc acccgcagac aag^ggtcag cctggccaaa a;g"acaaaa 3 300 

cacccagaac ccacaaaaca cccagaggrc caggagaatg Ccczaacgct caagaggcag 3 3 SO 

gaCcaagtga agaggttaca gaaatcagtg cctctggccg ggcagtcaag agagcgggcc 3 420 

caaat'tctgt. gacccacticc tccgtgtctc ggttggaaa- gaacgggtaC cccgcE-cccc 3 430 

accctcccac acgccgtgai accccaaac- ccecggg-ga agggcctctc ctcagcccaa 3S4Q 

gatcccgatc gcgaacaeca acaaagagaa cagtcatccn ccacagaaga taacccacta 2 600 

atgacactcg actcagcgaa taaatacaixc acttaaaaaa ataCtgtagg gggaccacga 3 660 

aagOagcgga gg~aaccaca accaggagag attgg-atca aaactgagca aagtcccaac 3 720 

tctcaccaga cgacaatcac gcaccccgc- agacgcccca gggcCg-cag ' cctggaactg 3780 

aaataaatgt gtcaiaagcg gcgctggacg cctccctcag cccacc-gaa aacatzggate 3 340 

tgatcatgtc agccccctCC ctgctggaaa aaaaagcagc ttgca^aacc gg"gttaacc 3 900 

acCccg-ctC gattccacag agcaagtaac acccaaatgc ggticccaccc caacctcc^g 3 960 

cctttgttiac ccccagaacc acgcagacac cgaaatgtgg cgcgcgtgtrg tgtg-gtg-g 4020 

tgtgtgtgcg tgcgcg-ccca tataacagga ggacaggaaa ggtaaggacc cagaacaacg 40 30 

aagacctat: gaaatgnggg gggggtg-gt gcgtgtgcgc g-tcatagaa caggagnaca 4140 

ggaaaggtaa tgaccaccca tggaaga~ga aggggtagra c-cagccgga cgcgcgggcc 4200 

ga 4202 

<210> S7 

<211> 354 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITH 
<222> (331) 

<223> n. equals a,t,g, or c 
<22Q> 

<221> SITE 
<222> (339) 

<223> a equals a, eg, or c 
<22Q> 

<221> SITS 
<222> (844) 

<223> n equals a,t,g, or c 



<22Q> 
<221> 
<222> 



SITE 

(351) 
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<223> n equals a,t,g, or c 



<40Q> 57 

cgaaaccacc cctcaccaaa gccgaacaaa agctggagct cgcgcgcctg cagg~cgaca SO 

ctagtggacc caaagaatcc ggcacgagcc ggaccccggc aaaatgcaga ccccccaaaa 120 

Ctcttguaaa tacagatgac acccacgaac agc-nccacnc aatcgttcat aaggtaacaa 130 

ctgttcccci: cccagcgtca acgaccgngc cgaagcggat cttcct-gca cgtgracacg 240 

agtgcacgcg caaacccctc aaacgccccc tggaaaagac accggaagtc ctgacicacgg 3 00 

taaaactcaa aacgggcgcc ccCccagcgc aacaag—ca tttccagcnc ctcccaaaca 3 SO 

gutctgtraC tagcgaaaga ggaaccgtcc aagactg~ga tctataaacg cgcgaag-cc 420 

cacg-gc-cn caaaccaaag gctg-cagag gttgg-gccg cccgtcctcg aaacggctcc 43 0 

ggacgggggc cgtagccacg tgtccgtgca Cacgctgccc ttgccc~gac eccaaagcng 540 

taggcttgcc aacnccataa ggaticgtact: ttgkttccgt caggacatgg cccegcaagg SQO 

atatgcactc aggtcgtgtc cccaactaaa ggcagcc- tt gattcaagaa agaagacgga 650 

gcacgcgcac gtgtccctcc ccccccctgc ctgaggc-gc ggagaagctc ccatccacaa 720 

aggc.tcagaa acgacgccgt. gggggamcag gaaggagcgg agaactagcc ccgagagcac 730 

ccccagagcg gccgcgggcc caccgacttc ccacccggge. ggggtaccag acaagcgtng 340 

aagrtaccccc ncza 354 



<210> 53 

<21i> 1455 

<212> DNA 

<213> Homo sapians 



<22Q> 

<221> SITS 
<222> (5) 

<223> n equals a,c,g, or c 



<22Q> 

<221> SITE 
<222> (39) 

<22 3> n equals a,t,g, or c 



<220> 
<221> SITS 
<222> (40) 

<22 3> n equals a,C,g, or c 



<4G0> S3 

ggcanagccg ggatggggtc agcacccaga agccagccnn cCctgacagc cccctctccg SO 

gccaagcsot gcctctgtac agccizcgagt ggacagccag agggcacgag ggagcccaga 120 

gcccaagatg gagccccagc tggggccnga ggccgccgcc ctccgccctg gccggccggc ISO 

cccgccgctg tgggccccag cccegagccg ttc~zzczcc tcgccagctt cteccctcic 240 

ttctccggtg ccccaagcca gaaccagcca caa-ctcgga aggacttiucc ccggtcccga 3 00 

taaaegcaat gcccgcaccg ggacacccac tcgcaagaag tcccttiaaag aagaaa~aag 3 60 

aCctgacaac tggctggc-u cocaccctigg actgcccccc gaccccctgc cctcctarcc 420 

tgcaaac-ac tcagacga" ccaaaacccg gcgccccgeg gagatcct-a gactggccag 43 0 

caaacaccaa aacgagaccc cagacaggaa aacctgcgcc cccgcatcag ccccaaagac 540 

ctgcagcact gagcg-g-co tgcggaaaac agagaggcic cagaaatggc tgcaggccaa SQO 

gcgcctcacg ccggacc-gg tgcaggactg tcaccagggc cagagagaac taaagtccoe 660 

gcgtangctg aganaacacc agtgaaaaag cscggcatgg agcccagcac tgagaac"c 72 0 

cagaaagcgt cagccccccc ccaactgcgc tataccaacc acattctcaa atagcaa-ca 73Q 

CCaaagaggc taccgcatca aacccccaca cgcagccccc a:gccacctc cagaat^cac 340 

caacacacag gcccaccagc aacaggcacc tctgcacaat accent cgat gacaatccaa 900 

agccccggcc cccccccacc acaccg-ggc cccccagacg gggctgccgc cgagcccacc 560 

ccaaccccag acg-gatccc ccctgcgacc Cacizccctigg caagat cccn ccag-cccgg 1020 

acaggee- cc ccnacgagac agaacccgat aaggagctag ggcaatcctg acaaca"ac 1030 
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caaaggccca ca»aact:-c;: aaac^ccgg- ccggtccgaa ggaaaacccg cccc~gccct 1140 

agtgatggan gaacocoooo aticcocggct Cctagaggga aaaaaaagca taccocoon- 12 0 0 

act-ctcaag cacccccacc agagtcacga aaccacc-gt caagacta?:c tacctrt^cac 12SQ 

gtttccatrc cgg-caagaac tcxtcaaacg aggacactgc tgactgecgg tgacgcti- en 13 20 

tgagcaaaca cccgggggca tggatgaaag ccaaccgcag gucaaacgac ccczcgggga 12 30 

agctacctcz cctccattca gacctcacta aaatcttcca agatgaaagc aaaaaaaaaa 1440 

aaaaaaaaaa aaaaa 1455 



<21Q> 59 

<211>- 593 

<212> ENA 

<213> Homo sapiens 



<4QQ> S9 

ccacgcttcc ggagccc~ca cttccccagc aacgcccttc agtacacaac aagcctaact SO 

gataaacaga a^atccagaa aggcgagact Cgggcttacc accgggcdta aaccacaggg 120 

acctagggcg agggoocagg gc-cctctgg agcagacacc gucaagttca tggcc-cagg 130 

Cagcatgtiat coggccccaa ccc^gaicgc agcaaaagrn c-gagaggag c^gagccccg 2 40 

ttg-ggccca ccaaagaaca gggccctcag gcccigcccg c-tcccgncc accgccccct 3 00 

ccccaccccc agcccagccg agggaatccc gtiggg-tigc- tacczaccta caagg-gg~c 3 SO 

cacaagc-gc tgtcctggcc accgcactca aat:ccaacg cgt:actccac ag^gtaaaaa 420 

tccataccac tgtgaggctt tcegcct.ucc ttccttcttn tccccttccg g-ataiicgct 430 

gcatccactc taacocccag aaacaaacgo tacataggaa ccgacaaaaa aaaaaaaaaa 540 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 593 



<210> SO 

<211> 49S 

<212> DNA 

<213> Homo sapiens 



<4Q0> SO 

aattcggct; tcgagcggcc gcccgggcag gccccgcoot ttgcttataa ttgacaacac SO 

gtgcaaaaac accaaacccg Cgtcctgtgc ag-acgaaga atecagtgaa taitcatcaa 12Q 

tgtattagcc tgccctgccc tctg~ccaca tacggctcza ttcttagaaa cacaactcga 13Q 

acgtgatcct tcaacagtct gaatactcca caaa-tatag ctacgtctcg tgaaaataac 240 

ctcaaaaaga aaaacacgac cccgtcgncc eacocgacac ttcucgccc- agtaatgtac 3QQ 

tcgacactca tgotccoaag cagtgoaagt accagtagaa tocctctgoc aaactcaatg 3 SO 

accacttagt acttt^gcct tctoccatgn gctttgaagga aaaacaaagc gccaccaccg 420 

tatcccctgt tttcaccaaa aaataaaaaC aacccaaaaa aaaaaaaaaa aaaaaaaaaa 430 

aarrgggsgg cccccc 495 



<21Q> SI 

<21i> 1292 

<212> DNA 

<213> Homo sapiaris 



<22Q> 

<221> SITE 
<222> (71) 

<223> c. equals a,t,g, or c 



<220> 

<221> SITE 
<222> (S97) 

<223> a equals a,t,g, or c 



44 



<22Q> 

<221> SITE 
<222> (1230) 

<223> c equals a,t,g, or c 
<22G> 

<221> SITE 
<222> (1237) 

<223> ii equals a,t,g, or c 
<220> 

<221> S~TE 
<222> (1291) 

<22 3> n equals a,C,g, or c 
<4QQ> SI 

aaacctctic tacaggtaaa - gc-ggiiacgc c-gcaggtac cggtccggaa tCcccgggcc 60 
gacccacgcg nccggaaaga ggaaacacag aggcgccaaa ggaacaaaga cacaatgacg 120 
ccacccaagc caacaagcca tgccgaagca aacgaaacca "acccaaccc t^acccacca 130 
agcagcttca tggcccccgg acctcaacag cctctgggtc caaccaacct: agaaaaccaa 240 
gctcaggg-g ctcagcgtgc tcagccctac ggcaccacat ciccgggaat cttcgctagc 3 00 

agtcaaccgg gtcaaggaaa tacacaaacg acaaatccaa g-gngggaac agcagtaacg 3 60 

aactc-aaag aagaagcaaa ggcac~aggg gcgacccaga ccatggccgg accgacgcac 420 
aCCggttccg gaatcgtcct gcg;t;aata scctcccctt cnagagaagt attaggtccc 430 
gcctctaccg ctgkcactgg tggacaccca ccctggggtg gcctcccttc caccaccccc 540 
ggccccctct ctgcgtcagc acccaaggag ctutcccgt" gtccggtgaa aggcagcccg 600 
ggaatgaaca ccggcaggtc caccctggcc ttcaccggag tgaccctgcc gc~ggtggac 650 
atgtgcatca acggggtaxc tggccaagac tactggnccg cgcttcctgg aaaaggcacc 720 
ccagccacgc cgacgatctc ccccytctcg gagccccucg cagcccgtgc cacagcccac 730 
titgccaacc aagcaaacac cacaaccaat atgtctgccc tgg-cactcc aaacacg-ac S40 
gaaagcaacc ccgkgacacc agcgtcttce tcagctcccc ccagacgcaa caactactica 900 
gctaacgccc ctaaaagaaa aaggggcatc agtctaaccc catggagaaa aaccacicgc 960 
aaaaactcet "aagaagatg ccct'cactg tctacaaega tc-ccagtct tcaaaaaccg 1020 
tgttcgagac tcgttcccag gccggccgct aaigacggct: gcacctcccc ccactgcccc 1030 
ttcccacatc accaccacta cacgctggca aaggtgaagg a;cagaggac cgaaaaacga 1140 
ttctgcaacc cccccaaagt; tagaaatgtt tctg-ccata ccactcttcc cttaataaaa 1200 
tg-cactaga aacaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaagggc ggccgctcca 1250 
gaggacccaa gctcacgtan gcgt.gcn.cgc ca 1292 

<210> 62 

<211> 393 

<212> DMA 

<213> Homo sapiens 



<40Q> 52 

cccccccggc ccccccatta aC.gccat.ggt ttgggaggag cckgaccccc cgaaccccat 

acsiaacctc cac~gcccras ggaaacgccc tacaccactt: ccccctaact ggaagcataa 120 

tcagagcgac gttggcaggg cagaaaaaga gggagccacc cgatgctccc aggctaacca ISO 

gagctacggg tgccacaggc tcgtctctct aag-gacaca tcctcaccta attcccagac 240 

caggtc"cga aagntwtggg ggtctccica gacczcaacc cctacctcct ccacggtaca 300 

aacacgtiaca aaagaaaaag gtictcacatt ccct-acaca aacccacaaa caaatctcga 3 50 

accccttctg tctaaaaaaa aaaaaaaaaa aaaaaaaa 393 
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<21Q> 63 

<211> 1202 

<212> DNA 

<213> Homo sapiens 



<22Q> 

<22l> SITE 
<222> (232} 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (596) 

<223> n equals z.Z.q, or c 
<22Q> 

<221> SITE 
<222> (607) 

<223> a equals a,t,g, or c 
<22Q> 

<221> SITS 
<222> (120Q) 

<223> n equals a,t,g, or c 
<400> S3 

gaattcggca cgagactaag tcg-gcac" taattgggtg aatcgtacat gtragtcata SO 
CaCccrac-g tagccgcwac taaaaaacaa caggaggcca tgtgggccgc taggag~agc 120 
aaCg~cHguy cccagccagc aggtagagac cagggctgga cagagicagna tgggccgcgc 130 
tgcagacnat tcg^ggtacc caacCgctgc aCaaaacagg gtgtgatccc ttgcaccgct 240 
atgcacgagt ggaiitcccag taaatcgiigc caggc-gccc gnacgacgzg tggczcgtgc 2QQ 
tttggascgn aatgccCacc cacgceactt aagtcacaca ccctgcggcc tctg-ggcca 3S0 
ggacigcggg ctiactacctg kagngacccg tCaggggaaa ggacccacag cccg-gcagg 420 
aggaaaaaag catctctgag Cacagggcgg atgagccgga tgagctgccg ggcaagagcc 430 
acgcacaccc aggtggtgag cctcaaggat aaggrggaat t-gccccata gccgccccgg . 540 
acagaaaccg cccagagaag aacgaatgga ggaca^aggg c-ccgt-ggtc ccaccncttc 600 
ttggganaoc Cgtgactggz cctgttacca tgccaactca gccccaaacc catctctgat SS0 
tgactcggnc gcttantt-g gcacattctt gc:ccacaca gccacataca tactggctgc 720 
tcctcsaagg ccaggcagat gcagcagccg tcgggccasc aaagargaar gtcctggaar 730 
gttccggcct: gaacgctgca cctgtcg-g- gacagccaca actgctcagg ctcccttgcc 340 
Cgtggg-gca c-g^ggggag gagtg~~a"g ataagaacat cggctctcag tc"ccccgg 900 
ggagaagccc ggccccacgt gggat~r.ggg cgctgccttc aggaaggccc tccgcacgtc 9S0 
tagtzccagc csigtactggg aagaattaaa aaag-ctgcc agctccnctia gcctgicczg 1020 
ccttitgtga cgatcctccc tgagatcccc ccctaicagc tcaggagcgg gactccccgg 1030 
agaaggaaag CgCCCttcc^ gccccccact gcCcacctcg gggcatccag gaacatgggc 1140 
ctgacgaacc tgcccgaagg cactctgtiaa tcccatcacc ttgggagcca aagaggcggn 120 0 

ca " " 1202 



<21Q> 64 

<211> 1517 

<212> DNA 

<213> Homo sapiens 



<40Q> 64 

gatzacgcca actcgaa"c aaccctcacn aaagggaaca aaagctggag ctccaccgcg SO 

gtggcggccg ctctagaacc agtggacccc ccgggccgca ggaactcggc acgagggagc 120 

ccagagccca agatggagcc ccagctgggg cctgaggccg ccgccc^ccg ccccggctgg 130 

ccggccctgc tgccgtgggt: ctcagccctg agcngccccc ccccctcgcc agcc:c;ccc 240 

cxtccctc-c tggcgcccca agccagaacc agctacaatt ttggaaggac tttzcccoggn 300 

cttgacaaat gcaaigcccg caccgggaca Cctacttgca agaag-ccts: taaagaaaga 360 

aacaagaccc gacaaccggc tggc-tccca cctcgggact gccucccgac tccctcgccn 420 

ucttatcccg gcaaattacC caggatgatt ccaaaa~c"g gcgccccgcg gaga-cc ciia 430 
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gactggticag caaacaccaa aacgagacc- cagacaggaa aatc-gtgcc cctgcaccag 54Q 

ccccaaagac ctgcagcact gagcgcgtcc tgcggaaaac agagaggntc cagaaacggc SQQ 

tgcaggccaa gcgccccacg ccggacctgg tgcaggacng ccaccagggc cagagagaac SSQ 

taaagtccct g-gtiatgctg gagataacac cagtgaaaaa gccctggcai ggagccccag 720 

cactgagaac ncccagaaag cgtcagcctt cccccaaccg Cgtcatacca accacactcc 730 

caaacagtaa ccatcaaaga ggccnctgca tcaaacctcc acatgcagcc ccca-gccac 340 

cctccagaat ccaccaacac acaggcccac cagcaacagg cntacctcti; gcacaaca-t: 9Q0 

ttttgacgac aatccaaagc cccqgctczz ccccaccaca ccgtggtccc ctagacgggg 960 

ctg"gctga gcccacccca accccagatg tgacccccct gzgacctiacTi tccgggccaa 1020 

gat^ccccag cccggacagg tcrtccccca tgagacagaa cccgaizaagg agcragggca 1Q8Q 

attctgacaa caecaccaaa ggcccacaca actcctaaac tteggcctgg tccgaaggaa 1140 

aacctgtccc tgccccagtg a;ggazgaac tctcttacc- ctggctccca gagggaaaaa 1200 

aaagcatacc tcttccactc tccaagtacc tccaccagag ccatgaaatc acctgtcaag 1260 

actacctacc ctccacg^tc ccattctggc aagaactcnc caaatgagga cactgctgat: 13 20 

Cgctggrgac gttccccgag caaacacccg gggg-acgga tgaaagccaa tcgcaggtca 13 SO 

aatgactcct tggggaagct actccccctc tatzccagact ccactaaaac ctcccaagat 1440 

gaaagcaaaa aaaaaaaaaa aaaaaaaaaa aci:cgagggg gggcccgcac ccaacccgcc 1SO0 

ccacagcgag tcgcacc , 151/ 



<2ia> 65 

<21i> 52S 

<212> DNA 

<213> Komo sapiens 



<220> 

<221> SITE 
<222> (SS) 

<223> n equals a,t,g, or c 



<220> 
<221> SITH 
<222> (1GS) 

<223> r. equals a, eg, ar c 



<22Q> 

<221> SITE 
<222> (484) 

<223> a equals a,t,g, or c 



<400> 65 

ctctgacagc tCcczcGC-g gccaagccct gcctctg-ac agec-cgag- ggacagccag SO 

aggtcnagac tggagcccag agcccaagac ggagccccag ctgggncctg aggccgccgc 12 0 

cctccgccct ggctggccgg cccCgctgcC gcgggrccca gccc-gagct gttctccctic 130 

cctgccagcc tccccccccc ctcceccggc gccccaag-cc agaaccagcc acaatcc-gg 240 

aaggacttcc ctcggt:cccg acaaaCgcaa Cgcccgcatc gggacatc-a ttzgcaagaa 3QQ 

gttccctaaa gaagaaacaa gacctgacaa ctggctiggc- tcccaccttg ggactgccnc 360 

ccgactccct ttgstucccc acccctgcaa aitactccar atgaccycca aaatctggsg 420 

sccccgtgga raccccccaa ccggccagca awtwtcaaac gaaatctcca aacaggaaac 480 

ctcr.Cgcctc ctgcacccac ccccaaagaa cccigcacatc gaegtc 52S 



<21Q> 66 

<211> S54 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> SITE 
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<222> (31) 

<223> c equals a,t,g, or c 
<22Q> 

<221> SITE 
<222> (€3) 

<223> n equals a,t,g, or c 



<4QQ> 65 

caggctctca atacggactc actcataggg naaagctggt acgcctgcag gtaccggtcc £0 

ggnaatcccc gggtcgaccc acgcgccgcr gagctctcac tcctccagca acrcrct'ca 120 

gtacataata agcttaactg ataaacagaa tatttagaaa ggcgagactt gggcttacca 130 

ttgggtztaa accacaggga ccnagggcga gggttcaggg cttccctgga gcagatactg 2 40 

tcaagztcac ggcctcaggt agcacgtatc tggtcttaac tctgattgta gcaaaagttc 3 00 

tgagaggagc cgagccctgt tgtggcccat taaagaacag ggccctcagg ccccgcccgc 3 SO 

ttcctgtcca ctgccccccc cccaccccca gcccagccga gggaatcccg tgggttgctt 420 

acctacctat aaggtggttc ataagccgct gtcctggcca ctgcattcaa attccaatgt 430 

gtacttcaca gtgcaaaaat ttacafctatt gcgaggtttc ttgccttttt cttttccttt 540 

ttttctggta tattgctgta tccactttaa cttccagaaa taaacgttac atrggaaaaa 600 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6 60 

aaaa ' - •- 664 



<210> 67 

<211> 156 

<.212> PRT 

<213> Homa sapiens 

<220> 

<221> SITE 
<222> (156) 

<223> Xaa equals scop translation 
<4Q0> 67 

Met Arg Leu Trp Lvs Ala Val Val Val Thr Leu Ala Phe Met Ser Val 
1 *5 10 15 

Asp lie Cys Val Thr Thr Ala He Tyr Val Phe Ser His Leu Asp Arg 
20 25 3Q 

Ser Leu Leu Glu Asp He Arg His Phe Asn He Phe Asp Ser Val Lau 
35 40 45 

Asp Leu Trp Ala Ala Cys Leu Tyr Arg Ser Cys Leu Leu Leu Gly Ser 
50 " 55 SO 

His His Trc Cys Gly Gin Glu Gin Cys Ala Gly Ala Pro Ala Ala Ala 
65 " " 70 75 SO 

Gly Leu Val Ala Gly His His Pro Arg Val Pro Leu Arg Gly His Leu 
S5 90 95 

Cys His Gly Glu Ala Ala Ala Leu Leu Arg Gly Ala Gin Ala His Pro 
io a 10 5 no 

Gly Pro Leu Val Lau Gly Pro Val Arg Val Asp Val His Phe Thr Arg 
115 120 125 



Arg Tie Leu Pro Ala Lau Val Ala Ala Val 



His Arg Ala Ala Arg His 



43 



13 0 



13 5 



140 



Pro Gly Pro Gly Ala Arg Gly Gly His Arg Gly Xaa 
145 ' 150 155 



<210> S3 

<211> 70 

<212> PUT 

<213> Homo sapiens 

<220> 

<221> SITS 
<222> (70) 

<223> Zaa equals scop translation 
<400> sa 

Met Ala Ala Arg His Lau Pro Gly Phe His Thr Tyr Thr Asa Leu Leu 
1 " 5 10 15 

Phe Leu Lau Lau Pro Ser Leu Leu Met Gly Tyr Ser Glu Sar Pro Pro " 
20 25 20 

Pro lia Thr hsa Ser Trp Ala Fro Phe Ila Ser Lau Thr His His Vai 
35 " 4Q 45 

Leu Sar Gin Ser Gin Sar Pro Leu Ser Ser Asn Cys Trp He Cys Lau 

' 50 55 SO 

Ser Thr His Thr Gin Xaa 
65 70 



<210> 69 

<21i> 502 

<212> PRT 

<213> Home sapiens 

<220> 

<221> SITS 
<222> (502) 

<223> Xaa equals stop translacion 

<400> 69 

Met Trp Lvs Leu Tm Arg Ala Glu Glu Gly Ala Ala Ala Leu Gly Gly 
1 " 5 10 15 

Ala Leu Phe Lau Leu Lau Phe Ala Leu Gly Val Arg Gin Leu Leu Lys 
20 25 30 

Gin Arg Arg Pro Met Gly Phe Pro Pro Gly Pro Pro Gly Lau Pro Phe 
35 40 45 

lie Gly Asn He Tyr Ser Leu Ala A.ia Ser Ser Glu Leu Pro His Val 

5.0 5 5 60 



Tvr Met Arg Lys Gin Ser Gin Vai Tyr Gly Glu Ila Phe Ser Leu Asp 

S3 70 75 8 0 
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Leu Gly Gly lie 



Glu Cys Lau Val 
1QG 

Pro Leu Phe Me~ 
115 

Tyr Gly Arg Gly 
13 0 

Arg Tyr She Gly 
145 

Glu Thr Lvs She 



Fhe -Asp Phe Lys 

iao 

Leu lie He Phe 

195 

His MeC lie Glu 
210 

Ser Val Phe Leu 
225 

Gly Lys His Gin 



Leu Ser Arg Lau 

2S0 

Pro Gin His Phe 
275 

Asn Asp Pro Ser 
290 

Gly Glu Leu lie 

3 05 . 

Trp Ala He Leu 



Gin Lys Glu lie 
340 

Asp Asp Lys Cys 
3 55 

Leu Arg Phe Cys 

370 

Glu Asp Ala Val 
385 



Ser Thr Val Val 

as 

His Gin Ser Glu 



Lvs Wet Thr Lys 
120 

Trp Val Asp His 
135 

Tyr Gly Gin Lys 
ISO 

Phe Asn Asp Ala 
1S5 

Gin Lau lie Thr 



Gly Glu Arg Phe 
200 

Leu Phe Ser Glu 
215 

Tyr Asn Ala Phe 
23 Q 

Gin Leu Phe Arg 
245 

lie Glu Lys Ala 



Val Asn Ala Tyr 
280 

Ser Thr Phe Ser 
295 

Ha Ala Gly Thr 
310 

Pha Met Ala Lau 
325 

Asp Lau lie Mec 



Lys Met Pro Tyr 
360 



Asn He Val Pro 
375 

Val Arg Gly Tyr 
39Q 



Leu Asn Gly Tyr 
90 

He Phe Ala Asp 
10 5 

Met Gly Gly Leu 



Arg Arg Lau Ala 

140 

Ser Phe Glu Ser 
15 5 

He Gin Thr Tyr 
170 

Asn Ala Val Ser 
185 

Thr Tyr Glu Asp 



Asn Val Glu Leu 
220 

Pro Trp He Gly 
235 

Asn Ala Ala Val 
250 

Ser Val Asn Arg 
2SS 

Leu Asp Glu Met 



Lys Glu Asn Leu 

3 00 

Glu Thr Thr Thr 
315 

Tvr Pro Asn Ha 
330 

Gly Pro Asn Gly 
345 

Thr Glu Ala Val 



leu Gly lie Phe 
380 

Ser He Fro Lys 
395 



Asp Val Val Lys 
95 

Arg Pro Cys Lau 
110 

Leu Asn Ser Arg 
125 

Val Asn Ser Phe 



Lvs He Leu Glu 
ISO 

Lys Gly Arg Fro 
175 

Asn He Thr Asn 

190 

Thr Asp Phe Gin 
205 

Ala Ala Ser Ala 



He Leu Pro Phe 
240 

Val Tyr Asp Phe 
25 5 

Lys Pro Gin Leu 

27Q 

Asp Gin Gly Lys 
285 

He Phe Ser Val 



Asn Val Lau Arg 
320 

Gin Gly Gin Val 
335 

Lys Fro Ser Trp 
3S0 

Leu His Glu Val 
3 55 

His Ala Thr Ser 



Gly Thr Thr Val 

400 



50 



Ila Thr Asn Lau Tyr Ser Val His Phe Asp Glu Lys Tyr Trp Arg Asp 
405 410 415 

Pro Glu Val Phe Kis Pro Glu Arg Phe Lau Asp Ser Ser Gly Tyr ?he 
420 425 430 

Ala Lvs Lys Glu Ala Leu Val Pro Phe Ser Leu Gly Arg Arg Kis Cys 
435 440 445 

Lau Glv Glu His Leu Ala Arg Men Glu Mec Phe Leu Phe Phe Thr Ala 

450 455 450 

Leu Leu Gin Arg Phe His Leu His Phe Pro His Glu Lau Val Pro Asp 
465 " 470 475 430 

Leu Lys F-o Arg Leu Glv Mec Thr Leu Gin Pro Gin Pro Tyr Lau He 
435 490 495 

Cys Ala Glu Arg Arg Xaa 
SQ0 



<21Q> 70 

<21L> 189 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<22L> SITE 
<222> (35) 

<223> Xaa equals any of Che naturally occurring L-axiu.no acids 
<22G> 

<221> SITE 
<222> (104) 

<222> Xaa equals any of Che naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (164) 

<223> Xaa equals any of she nacuraiiy occurring L-ami.no acids 
<22Q> 

<22i> SITE 
<222> (189) 

<223> Xaa equals stop translation 
<4QQ> 70 

irfgi- ir-c d-o ij.a pn e Ala Leu Cys Leu Leu Trp Gin Ala Leu Trp Pro 

" L ; ' s lq is 

Glv oro Gly Glv Glv Glu His Pro Thr Ala Asp Arg Ala Gly Cys Ser 

20 25 30 

Ala Ser Gly Ala Cys Tyr Ser Leu Kis His Ala Thr Met Lys Arg Gin 
35 ' 40 45 

Ala Ala Glu Glu Ala Cys He Lau Arg Gly Gly Ala Lau Ser Thr Val 



51 



50 

Arg Ala Gly Ala Glu 

' 65 

Pro Gly Pro Gly Xaa 
35 

Glu Arg Arg Arg Ser 

1QQ 

Phe 3ar Trp Leu Ser 
115 

Gin Trp Val Glu Glu 

13 0 

Lau Pro Gly His Arg 
145 

Cys His leu. Saa Ala 
165 

Cys Val Leu Arg Arg 
180 



55 

Lau Arg Ala Val Leu Ala 

70 75 

Gly Ser Lys Asp Leu Leu 
90 

His Cys Xaa Leu Glu Asn 
105 

Ser Asp Pro Gly Gly Leu 
120 

Pro Gin Arg Ser Cys Thr 

13 5 

Trp Gly Arg Ala Arg Ser 
150 155 

Asn Ala Thr Cys Ala Ser 
170 

Ala Pra Gly Pra Pro Leu 
135 



60 

Leu Leu Arg Ala Gly 

30 

Phe Trp Val Ala Lau 
95 

Glu Pro Lau Arg Gly 
110 

Glu Ser Asp Thr Leu 
125 

Ala Arg Arg Trp Val 

140 

Trp Lvs Glu Met Arg 
160 

Thr Ser Lau Arg Ser 
175 

Thr Zaa 



<210> 71 
<211> 436 
<212> PB.T • 
<213> Homo sapi 

<400> 71 
Met Gin Pro Sar 
1 

Leu Lau Sar Lau 
20 

Ala Tyr Thr Thr 
3S 

Pro Arg Ala His 
50 

Ser Sar Pro Ser 
65 

Arg Asp Phe Pro 



Arg Gin Val Tyr 
10Q 

Sar Tyr Gly Gin 
115 

Ala Gin Phe Trp 
13 0 



Gly Lau Glu Gly 
5 

Leu Leu Leu Lau 



Pro Gly Pro Pro 
40 

Thr Met Pro Gly 
5 5 

Thr Gin Gly Lau 

70 

Leu Val Asp Gly 
S5 

Gin Lys Gly Lau 



Thr Sar Leu Asp 

120 

Ser Ala Tyr Val 
13 S 



Pro Gly Thr Phe 
10 

Leu Leu Leu Gin 
25 

Arg Ala Lau Thr 



Thr Tyr Ala Pro 
60 

Gin Glu Gin Ala 
75 

His Asn Asp Leu 
90 

Gin Asp Val Asn 
105 

Arg Lau Arg Asp 



Pro Cys Gin Thr 
140 



Gly Arg Trp Pro 
15 

Pro Val Thr Cys 
30 

Thr Leu Gly Ala 
45 

Ser Thr Thr Lau 



Arg Ala Leu Met 

30 

Pro Leu Val Leu 
95 

Lau Arg Asn Phe 
110 

Gly Lau Val Gly 
125 

Gin Asp Arg Asp 



52 



Ala Leu Arg Leu Thr Leu Glu Gin lie Asp Leu lie Arc Arg Met Cys 
145 15Q 135 160 

Ala Ser Tyr Ser Glu Leu Glu Lau Val Thr Ser Ala Lys Ala Leu Asn 
155 170 175 

Asp Thr Gin Lvs Leu Ala Cys Leu lie Gly Val Glu Gly Giy His Ser 

ISO iaS 130 

Lau Asp Asn Ser Lau Sar He Lau Arg Thr Phe 'Tyr Mat Lau Giy Val 
195 200 205 

Arg Tyr Lau Thr Lau Thr His Thr Cys Asn Thr Pro Trp Ala Glu Ser 
210 215 220 

Ser Ala Lys Gly Val His Ser Phe Tyr Asn Asn lie Ser Giy Lau Thr 
225 * ' 230 235 240 

Asp Phe Gly Glu Lys Val Val Ala Glu Ken Asn Arg Leu Gly Met Met 

2 45 2S0 25 5 

Val Asp Lau Ser His Val Ser Asp Ala Val Ala Arg Arg Ala Leu Glu 
2S0 255 270 

Val Ser Gin Ala Pro Val He Phe Ser His Ser Ala Ala Arg Gly Val 
27S 230 235 

Cys Asn Ser Ala Arg Asn Val Pro Asp Asp lie Lau Gin Lau Leu Lys 
220 235 300 

Lys Asn Giy Giv Val Val Mac Val Ser Lau Ser Met Gly Val lie Gin 
305 ' " 3 LQ 315 320 

Cys Asn Pro Ser Ala Asn Val Ser Thr Vai Ala Asp His Phe Asp His 

3 25 3 30 335 

lie Lys Ala Val lie Gly Ser Lys Phe lie Giy lie Gly Gly Asp Tyr 
340 " 345 350 

Asp Gly Ala Gly Lys Phe Pro Gin Gly Leu Glu Asp Vai Ser Thr Tyr 
355 3 SO 365 

Pro Val Lau lie Glu Glu Leu Leu Ser Arg Gly Trp Ser Glu Glu Glu 
370 375 380 

Leu G'a Gly Val Lau Arg Giv Asn Lau Leu Arg Vai Phe Arg Gin Val 
335 " 330 335 400 

Glu Lvs Vai Gin Glu Glu Asn Lys Trp Gin Ser Pro Lau Glu Asp Lys 
405 410 415 

Phe Pro Asn Glu Gin Leu Ser Ser Ser Cys His Ser Asp Leu Ser Arg 
"420 425 43 0 

Leu Arc- Gin Arg Gin Sar Lau Thr Ser Gly Gin Giu Lau Thr Giu Ha 
435 ' 440 445 

Pro He His Tro Thr Ala Lys Lau Pro Ala Lys Trp Ser Val Ser Glu 
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450 455 460 

Ser Ser Pro Sis Kec Ala Pro Val Leu Ala Val Val Ala Thr Phe Pro 
465 470 475 430 

Val Leu lie Leu Trp Leu 
4S5 



<21Q> 72 

<2ii> aa 

<212> FRT 

<213> Home sapiens 

<220> 
<221> SITE 
<222> (38) 

<223> Xaa equals stop translation 
Mr <4Q0> 72 

D Met Vai Ala Ser Glv Crrs Leu Leu Leu Ala Gin Ala Ser Phe Leu Pro 

glS 10 IS 

~~ Leu Ala Pro Pro Gly Ala Leu Gly Ala Giy Cys Trp Met Asp Gly Arg 

+ 20 ' 25 30 

i|j 

83 Pro Leu Ala Pro Pro Gly Ala Lau Gly Ala Gly Cys Trp Met Gly Gly 
St 35 40 45 

rj Arg Pro Lau Ala Pro Pro Giy Ala Lau Gly Ala Giy Cys Trp Mec Gly 

tt ' 50 55 SO 

^ Gly Arg His Giy Ala Pro Leu Leu Giy Cys Lau Cys Pro Sar Gly Lau 

yj S3 " * 70 75 30 

fU Cys Ser Ser Tyr Val Cys Lau Xaa 

35 



<210> 73 

<211> 299 

<212> PPT 

<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (167) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<40Q> 73 

Met Met Ser Sar Lys Pro Thr Sar His Ala Glu Val Asn Giu Thr lie 
1 5 10 15 

Pro Asn Pro Tyr Pro Pro Ser Sar Phe Met Ala Pro Giy ?he Gin Gin 
20 25 30 

Pro Lau Gly Ser lie Asn Leu Glu Asn Gin Ala Gin Gly Ala Gin Arg 
3 5 40 45 



54 



Ala Gin. Pro T-rr Glv lie Thr Ser Pro Gly Us Phe Ala Ser Ser Gin 
SO * " 55 SO 

P^-o Glv Gin Glv Asn Ila Gin Wet lis Asn Pro Sar Val Glv Thr Ala 
65 " 70 75 80 

Va^ Met AST. Phe Lvs Glu Glu Ala Lys Ala Leu Gly Val lie Gin lis 

35 30 9 5 

Kac Val Glv Leu Wee His Ila Gly Phe Gly lie Val- Leu Cys Leu lie 
10 Q 10 5 110 

Ser Phe Ser Phe Arg Glu Val Leu Gly Phe Ala Ser Thr Ala Val He 
US " 120 12 5 

Gly Glv Tyr Pro Phe Trp Gly Gly leu Ser Phe He lie Ser Gly Ser 
13 0 " 13 5 140 

Leu Ser Val Ser Ala Ser Lvs Glu Leu Sar Arg Cys Leu Val Lys Gly 
145 ISO 155 150 



Ser Leu 



Gly Mec Asr. lie Xaa Ser Ser lie leu Ala Phe He Gly Val 



lbs 



70 175 



Tie Leu leu Leu Val Asa Met Cys lis Asr. Gly Val Ala Gly Gin Asp 
130 13S 190 

Tyr Tro Ala Val Leu Ser Gly Lys Gly lie Ser Ala Thr Leu Met lie 
195 200 205 

Phe Ser Leu Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 
2 1Q 215 220 

Asn Gin Ma Asn Thr Thr Thr Asn Met Ser Val Leu Val Ila Pro Asn 
225 220 235 240 

Met Ty~ G 1 " Se^- Asn Pro Val Thr Pro Ala Ser Ser Sar Ala Pro Pro 

245 250 255 

Arg Cys Asn Asn Tyr Ser Ala Asn Ala Pro Lys Arg Lys Arg Gly lie 
. " 2S0 2S5 270 

Ser Leu Ila Ser Tra Arg Lys Thr Thr Cys Lys Asn Phe Leu Arg Arg 
275 230 255 

Cys Leu Leu Leu Ser Thr Met Ila Ser Ser Leu 
290 295 



<21Q> 74 

<2.il> 43 

<212> PR.T 

<213> Homo sapisns 
<22Q> 

<22i> SITE 

<222> (43) 

<223> Xaa equals stop translation 
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<4GQ> 7 4 

Met Ala Lau Kis Pro Glv Ser Ser E:.s Leu Leu Val Ala Val Pro "val 

1 5 10 IS 

Ser Trp Phe Leu Pha Cys lie Pro Gly lie Ser Phe lie Thr Lau Ser 

20 25 30 

Trp Ser Tyr Gin Glu Ser Pro Val Ser Pile Leu Ser Val Glu Gly Xaa 

'35 40 45 



<210> 75 
<211> 44 
<212> P5.T 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (44) 

<223> Xaa aquals scop cranslatian 
<400> 75 

Met Tvr Ser Leu Phe Leu Thr Cys He Fhe Pro Phe Thr Leu Cys Kis 
I ' 5 10 15 

Lys Lys lie Leu Met Val Tie Kis Asp Phe Thr Gly Pro Val His Val 
"20 2 5 3 0 

i=he °ro Glu Lvs Thr Val Leu Glu Trp Asa Tyr Xaa 
35 ' 40 



<2iQ> 7S 

<211> 140 

<212> PB.T 

<213> Home sapiens 

<400> 76 

Met Cvs Ala Met Tyr Leu MeC lie Lys Ala Phe Leu Pro Lys Mac Leu 
1*5 10 IS 

Ala Gin Lys Ser Glv Asn lie Tie Asn -Met Ser Ser Val Ala Ser Ser 
20 " 25 30 

Val Lys Glv Val Val Asn Arg Cys Val Tyr Ser Thr Thr Lys Ala Ala 
35 40 45 

Val lie Gly Leu Thr Lys Ser Val Ala Ala Aj=p Phe lie Gin Gin Gly 
50 55 SO 

Va Arg Cvs Asn Cys Val Cys Pro Gly Thr Val Asp Thr Pro Ser Lau 
■65 " " 70 75 30 

Gin G<u Arg Tie Gin Ala Arg Gly Asn Pro Glu Glu Ala Arg Asn Asp 
35 90 9S 
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Phe Leu Lys Arg Gin Lys Thr Gly 
ICQ 

Ala Met Leu Cys Val Tyr leu Ala 
115 120 

Gly Asn Pro Val lie lie Asp Gly 

UQ 135 



Arg Phe Ala Thr Ala Glu Glu lie 
105 110 

Ser Asp Glu Ser Ala Tyr Val Thr 
125 

Gly Trp Ser Leu 

140 



<21Q> 77 

<211> 153 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (1S3) 

<223> Xaa equals stop, translation 
<400> 77 

Met Leu Val Val Cys Leu Leu Leu Ala Thr Gly Phe Cys Leu Phe Arg 
1 "s 10 15 

Gly Leu lie Ala Lau Asp Cys Pro Ser Glu Leu Cys Arg Leu Tyr Thr 

20 25 30 

Gin Phe Gin Glu Pro Tyr Lau Lys Asp Pro Ala Ala Tyr Pro Lys lie 
35 40 43 

Gin Me- Leu Ala Tyr Kec Phe Tyr Ser Val Pro Tyr Phe Val Thr Ala 
30 * 55 SO 

Leu Tyr Gly Leu Val Val Pro Gly Cys Ser Trp Met Pro Asp lie Thr 
65 * 70 75 30 

Leu lis His Ala Gly Gly Leu Ala Gin Ala Gin Phe Ser His lie Gly 
35 50 95 

Ala Ser Leu His Ala Arg Thr Ala Tyr Val Tyr Arg Val Pro Glu Glu 
1Q0 105 110 

Ala Lys lie Leu Phe Leu Ala Leu Asn lie Ala Tyr Gly Val Leu Pro 
115 120 125 

Gin Leu Leu Ala Tyr Arg Cys lie Tyr Lys Pro Glu Phe Phe He Lys 
13 0 " 13 5 140 

Thr Lys Ala Glu Giu Lys Val Glu Xaa 
145 ' 150 



<210> 7a 

<21i> 130 

<212> PRT 

<213> Homo sapiens 



<22Q> 

<22I> SITE 
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<222> (43} 

<223> Xaa ecruals any of Che naturally occurring L-amino acids 
<22Q> 

<22L> SITS 
<222> (130) 

<223> Xaa equals stop translation 

<4QQ> 73 

Met Ala Ala Ala Ser Ala Gly Ala Thr Arg Leu Lau Leu Leu Leu leu 
15 10 IS 

Met Ala Val Ala Ala Pro Ser Arg Ala Arg Gly Ser Gly Cys Arg Ala 
20 25 30 

Glv Thr Gly Ala Arg Gly Ala Gly Ala GLu Gly Arg Glu Gly Glu Xaa 
25 ' 40 45 

Pro Val Ser Ser Ala lie Pro Arg Arg Val Cys Trp Ser Leu Leu Ser 

U 50 55 £G 

J2 Pt - 0 P - Q T hr Arg Pro Pro Gly Pro Ala Pro Cys Pro Leu Pro Ser 

m SS 70 75 30 

+" Ala Gly Arg Glv Ala Ala Gly Leu Gly Pro Leu Ala Gin Gin Pre Val 
tfl " " 35 90 55 

HI 

H Ser Pro Ala Pro Ala Ser Pro Met Ala Pro Cys Ser Pro Arg Gly Phe 
LOO 105 HQ 

r 1 Pro Pro Ala His Gly Val GLu Pro Glu lie Lau Ala Thr Mec Pro Val 
if: " 115 " 120 125 

U1 Leu Thr Ser His Pro Pro Thr Pro Ser Pro Cys Ser Leu Gly Thr Cys 
O 130 135 140 

Arg Leu Lau Ser Ser Leu Cys Ala Phe Val Pro Gly Gly Lau Thr Leu 
145 ISO 155 160 

Leu Ser Leu Ala Glv Leu Gly Gly Pro Val Gin Ala Pro Ala Ala Pro 
165 170 175 

Pro Ser Leu Xaa 

130 



<210> 79 
<211> 70 
<212> PKT 

<213> Home sapiens 
<22G> 

<22i> SITE 
<222> <7G) 

<223> Xaa equals snop translation 
<40Q> 79 

Met Leu Kec Glv Ser lie Leu Tyr Val Leu Phe Cys Val Trp Leu Leu 
1 " 5 10 •. 15 
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Gin Cys He Phe Glu He Tyr Pro His Cys Cys Vai Tyr Fro Lys Cys 

20 25 " 30 

Val Leu Pile His Cys Gin lie He: Phe Cys Tyr Wee Asn lie Leu Gin 

35 40 ' 45 

Asn Ila Cys Leu Phe He Tyr Trp Trp lie Phe Ala Phe Val Pro Val 

50 55 60 

Trp Gly Tyr Tyr Glu Xaa 



65 


70 




<2ia> 


ao 




<211> 


191 




<212> 


PRT 




<213> 


Homo sapiens 




<220> 






<221> 


SITE 




<222> 


(191) 




<223> 


Xaa equals scop trans 


lation 


<4QQ> 


30 




Met Arg Ala Cys Pro Trp Ala 


Gin Val 


1 


S 





10 15 

Gly His Leu Ala Val Ser Gin Ala Gly Val Met Ala Gly Val Sar Gly 
20 25 30 

Gly Arg Gly Gly Arg Arg Leu Arg Gly Pro lie Thr Ser Arg Val He 
35 40 45 

Thr Ser Cys Gin Gin Pro Gly Val Gly Val Trp Val Sar Leu Arg Pro 
50 55 60 

Glu Leu Leu Asn Leu Glu Ser Leu Gly Val Ala Ala Lys Gly Val Tyr 
65 70 75 SO 

Asp Lys His Vai Ser Leu Asp He Ser Gly Glu Arg Ser Gly Ala Leu 
S5 90 95 

Val Thr Phe Ser Lys Gly Cys Trp Ala Ser Glu Gin Ser Pro Pro Met 

1QQ 10 5 110 

Ser Gin Pro Leu Gin Gly Pro Ser Leu Ser Leu His Pro Arg Pro Ser 
115 120 125 

Ala Ala Leu Val Met Ser Arg Arg Lys Val Leu Gly Cys Ala Gin Ser 
130 135 140 

Gin Glu Ser Lys He Cys Gin Ala Lys Ala Pro Gly Lys Ser Arg Arg 
145 150 155 ISO 

Ser Leu Gly Trp Pro Pro Gly Cys Gly Ala Ala Arg Ala Lys Thr Vai 
1S5 170 " 175 



Asn Thr Ala Leu Gin Leu Ser- Glu Pro Gin Phe Ser Asn Leu Xaa 
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130 


135 


4, Jm 'J 


P i 




<211> 


16 S 




<212> 


PRT 




<213> 


Homo sapiens 










<22L> 


SITS 




<222> 


(127) 




<223> 


Xaa equals any o 


f the naturally 


<220> 






<221> 


SITE 




<222> 


(136) 




<223> 


Xaa equals stop 


translation. 


<4QQ> 


31 





190 



Met Cys Leu Ser Leu Leu Ala Ala Leu Ala Cys Ser Ala Gly Asp Thr . 
1 " 5 10 IS 

Trp Ala Ser Glu Val Gly Pro Val Leu Ser Lys Ser Ser Pro Arg Leu 
20 25 30 

lie Thr Thr Trp Glu Lys Val Pro Val Gly Thr Asn Gly Gly Val Thr 
35 40 45 

Val Val Gly Leu Val Ser Ser Leu Leu Gly Gly Thr ?he Val Gly lie 
SO " 55 SO 

Ala Tyr Phe Leu Thr Gin Leu lie Phe Val Asn Asp Leu Asp lie Ser 
S5 " 70 75 SO 

Ala Pro Gin Trp Pro He lie Ala Phe Gly Gly Leu Ala Gly Leu Leu 
35 90 95 

Gly Ser He Val Asp Ser Tyr Leu Gly Ala Thr Met Gin Tyr Thr Gly 
10 Q 105 110 

Leu Asp Glu Ser Thr Gly Met Val Val Asn Ser Pro Thr Asn Xaa Ala 
115 120 125 

Arg His lie Ala Gly Lys Pro He Leu Asp Asn Asn Ala Val Asn Leu 
130 " 135 140 

Phe Ser Ser Val Leu He Ala Leu Leu Leu Pro Thr Ala Ala Trp Gly 
145 ISO 155 ISO 

Phe Trp Pro Arg Gly Xaa 
1S5 



<21Q> 32 

<211> 42 

<212> PRT 

<213> Homo sapiens 



<220> 



60 



<22I> SITS 
<222> (42) 

<223> Xaa equals stop translation 
<4QQ> 82 

Met Cys Gly Lau Val lie Lau Tro Pro Cys lia Met Thr Leu Phe Ser 
I " 5 10 IS 

Ser Lau Ser Thr Gly Asp Val Lau Leu Pro Cys Lys Ila Lau Val Gly 
20 " 23 30 

Lau Arc: Val Phe He Gly Ala Arg Val Xaa 
35 40 



<21Q> 33 

<211> 49 

<2I2> PRT 

<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<4Q0> 83 

Met Cys Phe Pro Ala Cys Lau Cys Ser Pra Leu Thr Cys Lau Lau Sar 
1 5 10 15 

Val Trp Lvs Pro Glv Lau Ala His Ala Val Val Kis Cys Met Leu Glu 
20 ' 25 30 

Pro vail Glu Phe Ala Arg Val Val Gin Tyr Glu Ala Gly Kis Val Lau 
35 40 45 

Xaa 



<21Q> 84 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITS 
<222> (57) 

<223> Xaa aquals stop translation 
<400> 84 

Met Lau He Ala Lvs Lau Pro Val Lau Glu Sar Ila Cys Phe Phe Met 
I " 5 10 15 

Leu Phe Lau Asn Pro Leu Val Ila Lau Leu Ser Lau Asn Asn Ala Lau 
20 2S 30 



Pro Lau Val 
35 



Phe Kis Pra His Ser Glu Phe Leu Glu Asp Kis As- Arg 

40 45 



61 



Gly Asp Thr Leu Pro Ser lie Val Xaa 
50 55 



<2io> as 

<211> 43 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (43) 

<22 3> Xaa equals scop translation 
<40Q> 35 

Met Leu Val Ala Thr Ala Val Cys Cys Tyr Leu Fhe Trp Leu lie Ala 
15 10 15 

lie Leu Ala Gin Leu Asn Pro Leu Phe Gly Pro Gin Leu Lys Asn Glu 
20 25 30 

Thr . He Trp Tyr Val Arg Phe Leu Trp Glu Xaa 
35 * 40 



<2ia> as 

<211> 41 

<2I2> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 

<4qq> as 

Mec Leu Leu Leu Trp Ala Phe Ser Gly Val Cys Ala Val Pro Ala Arg 
15 10 15 

Ala Thr Pro Val Pro Ser Ser Phe Cys Pro Gin Gly Pro Ser Leu Cys 
20 25 30 

Pro Lys Gin Pro Ala Ser Leu Ala Xaa 
35 40 



<21Q> 87 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (74) 

<223> Xaa equals stop translation 
<400> 37 

Met His Ala Tyr Ala Cys Val Cys Ala Cys Mec Leu Val Cys Val Cys 
1 5 10 15 
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Vai Cys Val Cys Arg Aia Lau Val lie Pro Thr Gla Gir. Arg His Arg 

20 25 30 

Arg Val Aia His Gly Arg Thr Ser Asp Ser Thr Lau Pro Cys Thr Val 

35 4Q 45 

Lys lie Trp Pro Ser Gla Arg Gly Asp Gly Arg Gly Glu Arg Gly Gla 

5Q 55 SO 





Arg Arg Arg Gly Thr Asp Trp 


Arg Gly 


Xaa 








S5 


70 












<21Q> 


83 












<211> 


47 












<212> 


PF.T 












-<213> 


Homo sapiens 












<220> 














<221> 


SITE 












<222> 


(34) 












<223> 


Xaa equals any of Che 


natural I 


y occurring 


L- amino 


acids 




<220> 














<221> 


SITE 












<222> 


(47) 










■LIU: 


<223> 


Xaa equals scop trans 1 


.atxon 










<4Q.Q> 


aa 












Met Kis His Pro Asr. Leu Cys 


Leu His 


Phe Kis Aia 


Aia Phe 


Ser Leu 




1 


5 




10 




15 



Cys Val His Gly Cys Leu Cys Val Gin Phe Phe Pro Phe Tyr Lys Asp 
20 25 30 

Thr Xaa His lis Gly Leu Glu Pro Thr Leu Met Thr Ser Ser Xaa 
35 40 45 



<210> 89 

<211> 63 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (S3) 

<223> Xaa equals stop translation 
<40Q> 89 

Met Leu Phe Leu Asn Val lie Leu Phe Ser Lau Thr Val Phe Thr Leu 
1 5 10 15 

He Ser Thr Ala His Thr Leu Asp Arg Ala Val Arg Ser Asp Trp Leu 
20 25 30 

Leu Leu Val Leu He Tyr Ala Cys Lau Glu Glu Leu lie Pro Glu Leu 

35 40 45 
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lie Phe Asn Leu Tyr Cys Gin Gly Asn Ala Thr Leu Phe Phe Xaa 
50 55 SO 



<210> 90 
<211> 70 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITS 
<222> (20) 

<223> Xaa equals any of the naturally occurring L- amino acids 
<220> 

<221> SITS 
<222> (70) 

<223> Xaa equals stop translation 

<400> 90 

Met Leu Leu Lys Leu His Thr Leu Trp Pro Leu Trp Pro Gly Leu Trp 
IS 10 15 

Ala Thr Thr Xaa Sar Asp Ser Leu Gly Glu Arg Thr His Sar Leu Cys 

20 25 30 

Arg Arg Lys Lys Ala Sar Leu Ser Thr Gly Trp Met Ser Trp Met Ser 
35 40 45 

Cys Arg Ala Arg Ala Thr His Thr Gin Val Val Ser Leu Lys Asp Lys 
50 SS SQ 

Val Glu Phe Ala Pro Xaa 
65 70 



<2iQ> 91 
<2il> 57 
<212> PRT 

<2I3> Homo sapiens 
<22Q> 

<221> SITE 
<222> (57) 

<223> Xaa equals scop translation 
<4Q0> 91 

Met Lys Glu Ser Arg Lys Met Leu Trp Val Phe Lys Met Leu Phe Phe 
15 10 15 

Lys lie Val Leu Trp Val Asn Leu Leu Ser Ala Ala Leu Ser Cys II a 
20 " 2S 30 

Gin Lys Gin Met Leu Gly lie Ala Pro Gin Lys Cys Val Pro Lys Leu 
35 40 ' 45 

Cys Phe Gin Leu Tyr lie Met Arg Xaa 
SO 55 
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<210> 92 

<211> S3 

<212> PRT 

<213> Eomo sapiens 

<22Q> 

<221> SITE 
<222> (68) 

<223> Xaa equals stop translation 
<400> 92 

Met Tyr Phe Lau Leu Ser Val Thr Ser Glu Ser Val Trp Arg Ser Trp 
1 " 5 10 IS 

Thr Lau Thr Phe Eis Ser Phe Ala lie Lau Ser Leu Arg Cys Trp Thr 
20 25 30 

Ser Leu Leu Leu Leu lie Pro Lau Thr Ser Cys Asn Phe Ser Ser Pro 

35 40 45 

Ser Trp Arg Met Thr Ala Ser Gin Val Pro Ser Lys Arg Lys Ala Ser 
SO 55 SO 

Men Thr Leu Xaa 
65 



<210> 93 

<211> 4S 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<4Q0> 93 

Met Lys Glv Trp Pro Val Phe Leu Leu Val Gin Ala Val Thr Phe Leu 
1 * * " 5 10 IS 

Ser Val Ala Gin Ser Gly Ala Met Ala Cys Ala Ala Ser Gly Val Val 
20 ' 25 30 

Tyr Ser Val Asp Val Pro Ala Cys Ser Ser Arg Ser Xaa 

35 40 45 



<21Q> 94 

<211> SS 

<212> PRT- 

<213> Kama sapiens 

<220> 

<221> SITS 
<222> (55) 

<223> Xaa equals stop translation 
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<40Q> 94 

Met Val Lau Ser Pro Trp Ala Cys Lau Phe Val Val Phe Phe Pro Tyr 
15 1Q 15 

Us Gin Sar Ser Leu Arg Ser Asp Lvs His Leu Gin Lau Sar Asa Xla 
2Q ' 25 30 

Lau Pro Thr Pro Ser His His lis His lau Pro Ala Sar 11a Cys Zla 

3= 40 45 

Gin Lau Arg Ala Gly Asn Xaa 
SO 5S 



<210> 95 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<4Q0> 95 

Met Cys Giu Tyr Val Lau Leu Lau Tyr lie Val Lau Lau Cys Asn Arg 
1 5 10 IS 

Ser Tvr Ala Val Phe Thr Gin Cys Val Lau Arg Ser Ser Pro lie Asp 
20 2 5 3 0 



Sar Ser Arg Asn Ala Val Leu Leu Xaa 
35 40 



<210> 96 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> 

<22i> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<4Q0> 96 

Mec Thr Thr Pro Gly Leu Lau lie Lau Phe Lau Ala His Val Cys Lau 
1 5 10 15 

Val Asn His Gin Gin Ala Ala Giu Pro Gly Trp Lys Gin His Cys Cys 
20 25 30 

Asn TjT2 Giu Gly His Arg Val Leu Xaa 
35 40 



<21Q> 97 
<211> 50 
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<212> PRT 

<213> Homo sapiens 

<22G> 

<22I> SITE 
<222> (14) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals step translation 
<4QQ> 97 

Met Leu Cys His Val T-^r Leu Leu Leu Val Gly His Ala Xaa Phe Ser 
1 5 10 15 

Leu Arg Cys Ser lie Asn Ser Ala 
25 3 0 

Trp Asn Ser Ser lie Cys Phe Asn 
45 



Val Gly Leu Met Gly Gin Arg Lys 
20 

Leu Arg Ser Ala Val Ser Ser Ala 

35 40 

Ser Xaa 
50 



<21Q> 9a 

<2ii> sa 

<212> PRT 

<213> Homo sapiens 

<220> 

<22L> SITE 
<222> (53) 

<223> Xaa equals stop cransla-ion 
<4Q0> 93 

Met Ser GIu Trp Cys Gin Pro Asp GLn lie Leu Leu Gin Phe Pro Val 
1 5 10 15 

Leu Ala Thr Met Ser Val Ala Phe Leu lie Gin Arg Cys Phe Cys Phe 
20 25 30 

Trp Trp Ph.e Val Leu Asn Ala Phe Ser lie Pro Ser Gly Thr GIu Lys 
3 5 40 45 

Lys Arg He Val Phe Lys Lys Trp Leu Xaa 
50 55 



<210> 99 

<211> 52 

<212> PRT 

<212> Homo sapiens 

<220> 

<221> SITE 
<222> (52) 
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<223> Xaa equals scop translation 
<400> 99 

Met Lys Val Val Val Val Met Val Val lis Leu Val Val Val Thr Leu 
L 5 10 15 

Val Val Val Val Met Val Val lie Leu Val Met Val Val Met Val Val 
20 25 30 

Ala Leu Val Thr Leu Thr Trp Gly ?rc Val Ala Val Thr Val Asp Ala 
35 40 45 

Gly Ser Trp Xaa 
50 



100 

45 

PRT 

Homo sapiens 



SITS 

(45) 

Xaa equals scop translation 



Phe Lau Arg Trp Leu Leu Thr Thr Phe Arg He Arg Ala 
5 10 15 

Ser Thr Pro Cys Trp Ser Pro Ser His Leu Gly Cys Leu 
20 " 25 30 

Lau Pro Arg Asp Lau Ser His Leu Giu Xaa 
40 45 



<210> 
<2il> 
<212> 
<213> 

<22Q> 
<221> 
<222> 
<223> 



<40Q> 100 
Mec Pro His 
1 



Ser Cys Gly 



Gin Pro Ala 
35 



<21Q> 101 

<211> 53 

<212> PRT 

<213> Eamo sapiens 

<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 101 

Met Ser Thr Lys lia Leu Gin Phe Leu Phe Ser Ser Cys Cys Trp Val 
1 5 10 15 

Pro Pro Met Leu Phe Lau Phe Lys Asn Thr Lys Cys Arg Thr Ser Leu 
20 25 30 

Leu Tyr Cys Ptie Tyr Phe lie Lau Leu Thr Cys Ser Lau Ser Glu Tyr 
35 40 ' 45 

Asp Ser Leu Leu Ser Ser Lys Val Phe Xaa 
50 55 
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<21Q> 102 

<211> 41 

<212> PRT 

<2L3> Hcmo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 102 

Mec Pile Trp Phe Trp Phe Leu Leu Sar Lau Ser Phe Gin Gin Val Glu 
1 5 10 IS 

Gin Gin Gin Val Phe Gin Cys He Cys Cys Thr &rg Thr Lys Tyr Lys 
20 25 3 0 

Sar Val Trp His Gin Lys Ser Lys Xaa 

3S 40 



<210> 103 
<211> 143 
<212> PF.T 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (104) 

<223> Xaa equals any of Che naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (105) 

<223> Xaa equals any of the naturally occurring L-artino acids 
<22Q> 

<221> SITE 
<222> (115) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<22Q> 

<221> SITE 
<222> (143) 

<223> Xaa equals stop translation 
<400> 103 

Met Thr Leu lis Glu Val Leu Val Ser Val Leu lie Leu Ma Val Gly 
IS 10 15 

Leu Leu Arg Ala Ala Val lie Gin Leu Asn Ala Leu Lys Tyr Thr Asp 
20 25 30 

Ser Ser Arg Mec Thr Ser Gin Ala Ser Fhe lie Ala Tyr Asp Met Leu 
35 40 4S 

Asp Arg He Arg Ala Asn Ser Gly Ala Asp Tyr Ser Trp Gly Gin Gly 
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50 55 60 

Glu Arg Ala Pro Ser Thr Thr Ser Val Ala Ser Val Arg Asp Lau Asp 
65 70 75 30 

Lau His Asp Phe Glu Ala Asm. Ila Val Gly Phe Ala Gly Glu Ser Ala 
35 90 95 

Lys Gly Ser Val Ala Val Asn Xaa Xaa Glu Val Thr He Ser He Ser 
100 105 110 

Trp Asp Xaa Ser Arg Gly Ala Asn Ala Gin Gly Thr Arg Glu Thr Phe 
115 12 Q 125 

Thr Leu Thr Ser Arg Val Ala Val Asp Pro Arg Val Leu Pro Xaa 
130 135 140 



<210> 104 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (44] 

<223> Xaa equals stop trans la-ion 
<4Q0> 104 

Met Ala Phe Phe Phe Ala Leu Phe Val lie Phe Phe Val lie Val Val 
15 10 15 

Gin Met Glu Ser His Ser Gly Leu Gly Lys Lys Ser Lys lie Leu Ser 
20 25 30 

Gly Gly Gin Gly Glu Glu Val Tyr Phe Leu Asp Xaa 
35 * 40 



<2LQ> 


105 








<2il> 


S3 








<2L2> 


PRT 








<213> 


Homo sapiens 








<22Q> 










<22i> 


SITE 








<222> 


(63) 








<223> 


Xaa equals scop trans 1 


.ation 




<4Q0> 


105 








Met Tyr Phe Tyr Leu Ala 


Val 


Lys Pro 


Pro 


1 


5 






10 


Gin Val Ser Cys Arg Lau 


Ser 


Val Ser 


Leu 




20 




25 





io 



30 



His Lau Trp Ala Leu Phe Leu Phe Ser lie Leu Leu Gin Cys Arg Ala 
35 40 45 
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Arg Phe Leu Lau Leu Lau Val Leu Ser Gin Thr Gin Asp Leu Xaa 

50 55 60 



<21Q> 106 
<211> 233 
<212> PS.T 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (232) 

<223> Xaa equals soap cranslacicn 
<400> LOS 

Met Gly Ser Pro Gly MeC Val Leu Gly Leu Leu Val Gin lie Trp Ala 
1 " 5 10 15 

Leu Gin Glu Ala Ser Ser Leu Ser Val Gin Gin Gly Pro Asn Leu Leu 

2Q 2S 30 

Gin Val Arg Gin Gly Ser Gin Ala Thr Leu Val Cys Gin Val Asp Gin 
35 • 40 45 

Ala Thr Ala Trp Glu Arg Lau Arg Val Lys Trp Thr Lys Asp Gly Ala 
30 55 60 

lia Leu Cys Gin Pro Tyr lie Thr Asn Gly Ser Leu Ser Leu Gly Vai 
65 * 70 75 30 

Cys Gly Pro Gin Gly Arg Leu Sar Trp Gin Ala Pro Sar His Lau Thr 
85 90 95 

Leu Gin Leu Asp Pro Val Ser Lau Asn His Ser Gly Ala Tyr Val Cys 
1Q0 105 HQ 

Trp Ala Ala Val Glu Ha Pro Glu Leu Glu Glu Ala Glu Gly Asn He 
LIS 12 0 125 

Thr Arg Leu Phe Val Asp Pro Asp Asp Pro Thr Gin Asn Arg Asn Arg 
130 135 140 

lie Ala Ser Phe Pro Gly Phe Lau Phe Val Leu Leu Gly Val Gly Ser 
145 150 155 ISO 

Met Gly Val Ala Ala lie Val Trp Gly Ala Trp Phe Trp Gly Arg Arg 
1S5 170 175 

Ser Cys Gin Gin Arg Asp Ser Gly Asn Ser Pro Gly Asn Ala Phe Tyr 
130 135 19Q 

Ser Asn Val Leu Tyr Arg Pro Arg Gly Ala Pro Lys Lys Ser Glu Asp 
193 ' 20Q 205 

Cys Ser Gly Glu Gly Lys Asp Gin Arg Gly Gin Ser He Tyr Ser Thr 
210 ' " 215 220 

Ser Phe Pro Gin Pro Ala Pro Arg Gin Fro His Leu Ala Ser Arg Pro 
22S 230 235 240 
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Cys Pro Ser Pro Arc Pro Cys Pro Ser Pro Arg Pro Gly His Fro Val 

245 250 253 

Ser Met Val Arg Val Ser Pro Arg Pro Ser Pro Thr Gin Gin Pro Arg 

2SQ 265 270 

Pro Lys Gly Phe Pro Lys Val Gly Glu Glu Xaa 
275 230 



<210> 107 

<211> 93 

<212> PRT 

<2I3> Homo sapiens 

<220> 

<221> SITS 
<222> ( 98 ) 

<223> Xaa equals stop translation 
<400> 107 

Met Cvs Lys Lau Cvs Phe Tyr Leu Tyr Leu Cys Thr Trp Phe Pro Pha 
1 * ' "s 10 15 

Gly A.la Ser Gly Leu Phe Trp Asp Lys Trp Cys Lau Pro Arg Kis Leu 
20 25 30 

Pro Val Val Ser Gly Gin Glu Gin Leu Ser Ser Ser Lau Pro Ala Ala 
35 ' 40 45 

Leu Leu Phe Lau Gly Arg A.rg Trp Arg Pro Pro Leu Arg Val Ser Pro 
50 ' 55 SO 

Gly Leu Ser Phe Arg Gly Gly Arg Ala Gly Glu Fro Gin Gly Trp Gly 
65 ~ 70 75 SO 

■ As-o Ser Irj Glu Mec Glu Vai Ala Pro Ala Pro Lau Asp Gin Tyr Trp 
85 90 9S 

Leu Xaa 



<210> 10.8 

<2il> 62 

<2I2> PRT 

<213> Homo sapiens 

<22Q> 

<221> SITE 

<222> (62) 

<223> Xaa equals scop translation 

<40Q> 108 

Met Cys Lau Leu Leu Leu Trrj Leu Thr Thr Phe Gin Arg Thr Ser Gly 
L * 5 10 15 

Ala Leu Arg Arg Gly Gly Leu Ser Ser Fro Ala Trp Ala Met Arg Ser 
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2 0 25 3.0 

Pro Ser Val Tyr Ser Thr Gin Thr Pro Ser Pro Met Met: Ser Thr Gly 
3 5 40 45 

Thr Leu Arg Gly Lau Ser Gly Ala Met Cys Asn Lau Ser Xaa 
50 55 60 



<2I0> 109 

<211> 47 

<212> PRT 

<213> Eomo sapiens 

<220> 

<221> SITS 
<222> (47) 

<223> Xaa equals stop translation 



<4Q0> 109 

Met Lys Leu Cys Lys- Leu Thr Gin Cys Ser Phe Leu Leu Lys Ser Leu "' 
15 10 15 

lie Leu Leu Leu Glu Gin Lau Asn Val Ser Met Gly Phe Val Ala Ala 
20 25 30 

Fhe Asp Val Leu Val Gly Cys Ser He Cys Phe Glu Lys E-s Xaa 

35 40 45 



<210> 110 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (47) 

<223> Xaa equals snop translation 



<400> 110 

Met Thr Thr Phe Ser Leu Cys Ser 
1 5 

Thr Sar Lau Val Sar Leu Pro Pro 

20 

Pro Val Gly Pro Glu Fro His Cys 

3 5 40 



Gin Leu Ala Leu Leu Cys Ala Cys 

10 15 

Phe Val Asp Tyr Lys Asp Thr Sar 
25 30 

Lys Gly Leu He Leu Thr Xaa 
45 



<210> 111 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<22i> SITE 
<222> (32) 
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<223> Xaa equals any o£ Che naturally occurring L-amino acids 

<22Q> 

<22I> SITS 
<222> (33) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals any of Che naturally occurring L-amino acids 
<22Q> 

<221> SITS 
<222> (42) 

<223> Xaa equals stop translation 
<400> 111 

Met Asn He Leu Val Cys Val Phe Trp Leu Trp Gly Gly Val Ala Gly 
15 10 13 

Ser ?m Giv Ara Kis lis Phe He Phe Thr Ser Val Lys Asn Val Xaa 
20 25 30 

Xaa Ala Ser Kl3 Cys Ala Trp Pro Xaa Xaa 
35 40 



<21Q> 112 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 112 

Met Gly Glv He Ala Leu Fro Ser Leu Ser Leu Cys Leu Leu Ser Ala 
1 5 10 15 

Gly Ser His Cys He Ser Pra Ala Asp Gin Glu Thr Gly Pro Lys Val 
20 25 30 

Thr Ala Pro Gin Gly Asn Phe Leu Pro 

35 40 



<210> 113 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals' scop translation 
<40G> 113 

Met He Val Leu Lys Trp lie Phe Leu Ala Cys Val His Glu Cys Mec 
1 "5 10 15 

Cys Lys Pro Leu Lys Cys Phe Lau Glu Lys He Leu Glu Val Leu lie 
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2G 



25 



Met Val Lys Leu Lys Met Gly Val Leu Pro Ala Xaa 

35 40 



<210> 114 
<211> 132 
<212> PRT 
<213> Homo sapiens 

<4Q0> 114 

Met Giu era Gin Leu Gly Pro Glu Ala Ala Ala Lau Arg Pro Gly Trp 
IS 10 15 

Leu Ala Lau Lau Leu Trp Val Sar Ala Lau Ser Cys 3er Phe Ser Lau 

20 25 3Q 

Pro Ala Ser Ser Lau Ser Ser Lau Val Pro Gin Val Arg Thr Ser Tyr 
35 40 45 

Asn Phe Gly Arg Thr Phe Lau Gly Lau Asp Lys Cys Asn Ala Cys lie 
50 55 SO 

Gly Thr Sar lie Cys Lys Lys Phe Phe Lys Giu Glu lie Arg Ser Asp 
.65 ' 70 75 80 

Asn Trp Leu Ala Ser Kis Lau Gly Leu Pro Pro Asp Ser Lau Leu Ser 
35 30 35 

Tyr Pro Ala Asn Tyr Ser Asp Asp Sar Lys lie Trp Arg Pro Val Glu 

10 Q 10 5 110 

He Pha Arg Lau Val Ser Lys Tyr Gin Asn Glu He Ser Asp Arg Lys 
115 120 125 

lis Cys Ala Ser Ala Ser Ala Pro Lys Thr Cys Ser lie Glu Arg Val 
130 13S 140 

Leu Arg Lys Thr Glu Arg Phe Gin Lys Trp Leu Gin Ala Lys Arg Leu 
145 150 155 ISO 

Thr Pro Asp Lau Val Gin Asp Cys His Gin Gly Gin Arg Giu Lau Lys 
1S5 170 175 

Phe Lau Cys Mec Leu Arg 

130 



<210> 115 

<2ii> ai 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITS 
<222> (31) 

<223> Xaa equals stop trans lacion 



75 



<4Q0> 115 

Met: Ala Leu Gly Ser 31ec Tyr Leu Val Leu Thr Leu lie Val Ala Lys 
1 " 5 10 15 

Val Leu Arg Gly Ala Glu Pro Cys Cys Gly Pro Leu Lys Asn Arg Val 
20 2S 30 

Leu Arg Pro Cys Pro Leu Pro Val His Cys Pro Lau Pro Lie Pro Ser 
35 40 45 

o ro jiia Glu Glv lie Pro Tro Val Ala Tyr Leu Pro lie Arg Trp Pile 
50 55 SQ 

Tie Ser- Cvs Cys Pro Glv His Cys He Gin lie Pro KeC Cys Thr Ser 
S5 70 75 90 

Xaa 



<21Q> US 

<21i> 49 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITS 
<222> (49) 

<223> Xaa equals scoa cranslacion 
<4QQ> 115 

Mac Ser Cys Glu Asa Asn Leu Lys Lys Lys Asn Thr Thr Leu Leu Ser 
1 " S 10 15 

Tyr Leu He £>he Leu Ala Leu Val Met Tyr Leu Thr Pha Mec Phe Leu 
20 25 30 

Ser Ser Val Ser Thr Ser Arg lie Ser Leu Ser Asn. Ser Met lie He 
35 40 45 

Xaa 



<210> 117 
<211> 204 
<212> PRT 

<213> Homo sapiens 
<22Q> 

<221> SITS 

<222> (31) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<22Q> 

<22i> SITS 
<222> (93) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<220> 

<22I> SITE 
<222> (99) 

<223> ICaa equals any of trie naturally occurring L-amino acids 



<220> 
<221> SITE 
<222> (115) 

<223> ICaa equals any of the naturally occurring L-amino acids 



<22Q> 
<221> SITE 
<222> (15L) 

<223> ICaa equals any of the naturally occurring L-aiaino acids 



<220> 
<221> SITE 
<222> (204) 

<223> ICaa equals stop translation 



<400> 117 

Mec Val Gly leu Met His lie Gly Phe Gly lie Vai Leu Cys Leu He 
i S 10 15 

Ser Phe Ser Phe Arg Giu Val Leu Gly Phe Ala Ser Thr Ala Xaa He 

20 * 25 -30 

Gly Gly Tyr Pro Phe Trp Gly Gly Leu Ser Phe lie lie Ser Gly Ser 
"35 40 45 

Leu Sar Val Ser Ala Ser Lys Glu Leu Ser Arg Cys Leu Val Lys Gly 
50 55 SO 

Ser Leu Gly Mec Asn He Gly Arg Ser He Leu Ala Phe He Gly Val 
65 " 73 75 30 

lie Leu Leu Leu Val Asp Met Cys He Asn Gly Vai ICaa Gly Gin Asp 
85 * 90 95 

Tyr Trr> Xaa Val Leu Ser Gly Lys Gly lie Ser Ala Thr Leu Met He 
iaa 105 HQ 

Phe Ser Xaa Leu Glu Phe Phe Val Ala Cys Ala Thr Ala His Phe Ala 

115 120 125 

Asn Gin Ala Asn Thr Thr Thr Asn Met Ser Val Leu Val He Pro Asn 
130 135 14Q 



Met Tyr Giu Ser Asn Pro Xaa Thr Pro Ala Ser Ser Ser Ala Pro Pro 

145 ' ISO 155 ISO 

Arc Cvs Asn Asn Tyr Ser Ala Asn Ala Pro Lys Arg Lys Arg Gly He 

155 - 170 175 



Ser Leu Ha Ser Trp Arg Lys Thr Thr Cys Lys Asn Phe Leu Arg Arg 
130 ' 135 130 



Cys Leu Leu Leu Ser Thr Mec He Ser Ser 
195 200 



Leu ICaa 
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<21Q> 113 

<211> 19 

<212> PRT 

<213> Hcmo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> Xaa equals scop translation 
<400> na 

Ser Leu Ksv Ala Pb.e Arg Leu lie Arg Ala Mec Gly Ala Thr Gly Lau 

i " s ia is 

Ser Phe Xaa 



<21Q> 119 

<211> 13 

<212> PRT 

<213> Hcmo sapiens 

<220> 

<221> SITE 

<222> (13) 

<2 23> ICaa actuals scop translation 

<4Q0> 119 

Leu Val Lau Trp He Val Met Leu Thr Tyr Ala Thr Xaa 
1 " 5 10 



<210> 


120 


<21i> 


ao 


<212> 


PRT 


<213> 


Homo 


<220> 




<221> 


SITE 


<222> 


oo ) 


<223> 


Xaa ■ 


<4Q0> 


12Q 


Met Glu Pr 


1 





5 10 IS 

Leu Ala Leu Leu Leu to Val Ser Ala Lau Ser Cys Ser Phe Ser Leu 

20 25 30 

Pro Ala Ser Ser Leu Ser Ser Leu Val Pro Gin Val Arg Thr Ser Tyr 
35 40 45 

Asn Phe Gly Arg Thr Phe Leu Gly Leu Asp Lys Cys Asn Ala Cys He 
50 " " 55 60 



Gly Thr Ser He Cys Lys Lys Phe Pne Lys Glu Arg Asn Lys lie Xaa 
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65 70 75 30 



<2I0> 121 
<211> 146 
<212> PRT 
<213> Homo sapiens 

<22Q> 

<221> SITS 
<222> (96) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITS 
<222> (107) 

<223> Xaa equals any of the naturally occurring L-amino acids. 
<22G> 

<221> SITE 
<222> (111) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<22Q> 

<221> SITS 
<222> (115) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<22Q> 

<221> SITS 
<222> (122) 

<223> Xaa equals any of the naturally occurring L-ami.no acids 
<220> 

<221> SITE 
<222> (132) 

<223> Xaa equals any of Che naturally occurring L-amino acids 
<4GQ> 121 

Med Glu Pro Gin Lau Gly Pro Glu Ala Ala Ala Leu Arg Pro Gly Trp 
IS 10 IS 

Leu Ala Lau Lau Leu Trp Val Ser Ala Leu Sar Cys Ser Phe Ser Lau 
20 25 30 

Pro Ala Ser Ser Lau Ser Ser Lau Val Fro Gin Val Arg Thr Ser Tyr 
35 40 45 

Asn Phe Gly Arg Thr Phe Leu Gly Lau Asp Lys Cys Asn Ala Cys lie 

SO S3 SO 

Gly Thr Ser He Cys Lys Lys Phe Phe Lys Glu Glu lie Arg Ser Asp 
65 70 75 30 



Asn Trp Lau Ala Ser His Leu Gly Thr Ala Ser Arg Phe Pro Lau Xaa 
35 90 95 



79 



Ser Tyr Pro Cys lys Lau Lau Gin 

100 

Cys Gly Xaa Lau Leu Thr Gly Gin 
115 12 0 

Gin Glu lie Xaa Cys Lau Lau Sis 
13 Q 13 5 

Asp Val 
145 



Mec lie Xaa Lys He Trp Xaa Pro 
10 5 110 

Gin Xaa Ser Asn Glu lie Ser Lys 
12 5 

Pro Pro Pro Lys Asn Lau His Ila 

140 



<2IQ> 122 

<2ii> ai 

<212> PRT 

<213> Homo sapiens 
<22Q> 

<221> SITE 
<222> (81) 

<223> Xaa equals soap translation 
<400> 122 

Met Ala Lau Glv Ser Mec Tyr Lau Val Lau Thr Leu lie Val Ala Lys 
1 5 10 IS 

Val Lau Arg Gly Ala Glu Pro Cys Cys Gly Pro Lau Lys Asn Arc Val 
20 25 30 

Leu Arg Pro Cys Pro Leu Pro Val His Cys Pro Leu Pro lie Pro Ser 
35 * 40 45 

Pro Ala Glu Gly lie Pro Trp Val Ala Tyr Leu Pro lie Arg Trp Phe 
50 " 55 60 

lie Ser Cys Cys Pro Gly His Cys lie Gin Ila Pro Met Cys Thr Ser 
55 " 7Q 75 aa 

Xaa 



<21Q> 123 
<21I> 337 
<212> PRT 

<212> Homo sapiens 
<40Q> 123 

Glu Pro His Arg Gly Pro His Lau 
1 5 

Gly Gin Arg Pro Gly Lau Gin Asn 
20 



Pro Pro Asp Lau Gly His His His 

10 ' 15 

Ila Asn Val Phe Leu Arg Asn Thr 

25 30 



Val Lys Val Thr Gly Val Val Val Phe Met Phe Ser Lau Ser Trp Gin 
3 5 40 45 



30 



Leu. Ser Leu Val Thr Phe Met Gly Phe Fro He He Met Met Val Ser 

50 55 SO 

Asn He Tyr Gly Lys Tyr Tyr Lys Arc Leu Ser Lys Glu Val Gin Asn 
S5 " " " 7Q 75 80 

Ala Leu Ala Arg Ala Ser Asn Thr Ala Glu Glu Thr He Ser Ala Mec 
85 90 95 

Lys Thr Val Arg Ser Phe Ala Asn Glu Glu Glu Glu Ala Glu Val Tyr 
1QQ 105 110 

Leu Arg Lvs Leu Gin Gin Val Tyr Lys Leu Asn Arg Lys Glu Ala Ala 
lis 12 0 125 

Ala Tvr Met Tvr Tyr Val Trp Gly Ser Gly Leu Thr Leu Leu Val Val 
'13 0 " 13 5 - 140 

Gin Val Ser He Leu Tyr Tyr Gly Gly His Leu Val He Ser Gly Gin 
145 150 155 ISO _ 

Met Thr Ser Glv Asn Leu He Ala Phe He He Tyr Glu Phe Val Leu 
1S5 170 . 175 

Gly As-o Cys Met Glu Asn Val Ser Phe Ser Leu Ser Pro Gly Lys Val 

130 135 190 

Thr Ala Leu Val Gly Pro Ser Gly Ser Gly Lys Ser Ser Cys Val Asn 
log 200 205 

He Leu Glu Asn Phe Tyr Pro Leu Glu Gly Gly Arg Val Leu Leu Asp 
. 210 " 215 220 

Glv Lys Pro He Ser Ala Tyr Asp His Lys Tyr Leu Kis Arg Val He 
225 230 235 240 

Ser Leu Val Ser Gin Glu Pro Val Leu Phe Ala Arg Ser lie Thr Asp 
245 250 25S 

Asn He Ser Tyr Gly Leu Pro Thr Val Pro Phe Glu Met Val Val Glu 
260 265 270 

Ala Ala Gin Lvs Ala Asn Ala His Gly Phe He Mec Glu Leu Gin Asp 
275 " 230 . 235 

Gly Tyr Ser Thr Glu Thr Gly Glu Lys Gly Ala Gin Lau Ser Gly Gly 

290 295 300 

Gin Lys Gin Arg Val Ala Trp Pro Gly Leu Trp Cys Gly Thr Pro Gin 
305 ~ 310 315 320 

Ser Ser Ser Tro Mec Lys Pro Pro Ala Leu Trp Met Pro Arg Ala Ser 
3 25 330 335 

He 



<210> 124 



31 



<211> 315 
<2I2> PRT 
<213> Homo sapiens 

<4GQ> 124 

Met Ser Ser Ala Thr Trp Tor Ala Ala Ser Trp Arg Thr Ser Ala Thr 
15 10 15 

Ser Thr Ser Leu Thr Arg Cys Trp lie Ser Giy Gin Pro Ala Cys Thr 

20 25 30 

Ala Ala Ala Cys Cys Trp Gly Ala Thr lie Giy Val Ala Lys Asn Ser 

35 40 45 

'Ala Leu Gly Pro Arg Arg Leu Arg Ala Ser Trp Leu Val lie Thr Leu 

50 55 SO 

Val Cys Lau Phe Val Gly lie Tyr Ala KeC Val Lys Leu Leu Leu ?he 
65 " "Q 75 80 

Ser Glu Val Arg Arg Pro lie Arg Asp Pro Trp Phe Trp Ala Lau p'he 
ai 90 95 

Val Tro Thr Tyr lie Sar Lau Gly Ala Ser Phe Leu Lau Trp Trp Lau 
100 10 5 110 

Leu Ser Thr Val Arg Pro Gly Thr Gin Ala Leu Glu Pro Giy Ala Ala 
LIS 120 125 

Thr Glu Ala Glu Gly Phe Pro Gly Sar Giy Arg Pro Pro Fro Glu Gin 
13 0 " 13 5 140 

Ala Ser Giy Ala Thr Leu Gin Lys Lau Leu Ser Tyr Thr Lys Pro Asp 

145 150 155 ISO 

Val Ala Phe Leu Val Ala Ala Ser Phe Phe Leu lie Val Ala Ala Lau 
165 170 175 

Giv Glu Thr Phe Lau Pro Tyr Tyr Thr Giy Arg Ala lie Asp Gly lie 
ISO 185 190 

Val lie Gin Lys Ser MeC Asp Gin Phe Ser Thr Ala Val Val Cle Val 
195 " 200 205 

Cys Leu Lau Ala lie Gly Ser Ser Phe Ala Ala Gly lie Arg Giy Giy 
21Q 215 220 

lie Phe Thr Leu lie Phe Ala Arg Leu Asn lie Arg Leu Arg Asn Cys 
225 230 235 24Q 

Leu Phe Arg Ser Leu Val Ser Gin Glu Thr Sar Phe Phe Asp Glu Asn 
245 250 255 

Arg Thr Giy Asp Leu lie Sar Arg Lau Thr Ser Asp Thr Thr Mec Val 
260 255 270 

Ser Asp Leu Val Ser Arg Thr Ser HeC Ser Ser Cys Gly Thr Gin Ser 
275 230 235 



S2 



Arg Ser Arg Ala Trp Trp Ser Ser Cys Ser Ala Ser His Gly Ser Ser 

23G 295 300 

Pro Trp Ser Pro Ser Trp Ala Ser Pro Ser Ser 
3 05 " 3LQ 315 



<21Q> 125 
<211> 167 
<2X2> PR.T 
<213> Homo sacians 



<40Q> 125 

His Leu Leu Arg Pro Ala His Cys 
i " 5 

Arg Thr Giu Gly Gin Cys Pro Arg 

20 



Ala Phe Arg Asp Gly Gly Gly Gly 
10 15 

Lau His His Gly Thr Pro Gly Arg 

25 30 



Leu Gin His Arg Asp Arg Gly Glu Gly Arg -Pro Ala Val Arg Trp Pro 

35 40 45 

Glu Ala Ala Gly Gly Met Ala Arg Ala Leu Val Arg Asn Pro Pro Val 

50 " ' 55 SO 



Leu He Leu Asp Glu Ala Thr Ser Ala Lau Asp Ala Glu Ser Glu Tyr 
6 5 " 70 75 8 0 

Lau He Gin Gin Ala lie His Gly Asn Leu Gin Lys His Thr Val Lau 
35 90 95 

lie He Ala His Arg Lau Ser Thr Val Glu His Ala His Lau Ha Val 
100 ' 105 HQ 

Val Leu Aso lys Glv Arg Val Val Gin Gin Gly Thr His Gin Gin Leu 
115 " 120 125 

Leu Ala Gin Gly Gly Leu Tyr Ala Lys Leu Val Gin Arg Gin Hat Leu 
13 0 ' 13 5 140 

Glv Leu Gin Pro Ala Ala Asp Phe Thr Ala Gly His Asn Glu Pro Val 
145 ISO 155 ISO 

Ala Asn Gly Ser His Lys Ala 
1S5 



<21Q> 12S 
<211> 227 
<2L2> PRT 
<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (71) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<4Q0> 12 S 

Arg Lau Thr Lys Thr He Ser Phe Ser Lau Gin Asn Gin Thr Ala Phe 
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IS 10 15 

lie Asn Ser leu Ala Lys Thr Pro Tyr Gin Ala Leu Thr Gly Ala Ala 

20 25 30 

Leu Ala Gly Ser Tyr Pro lie Trp Glu Asn Glu Asn Thr Leu Ser Trp 
35 40 45 

Tyr Leu Pro Ser Pro Thr Thr Leu Leu Ser Pro Pro Val Leu Phe Cys 
50 55 SO 

Val He Gin Leu lie Phe Xaa Leu Pro Ala Asn Trp Ser Gly Thr Cys 
65 70 75 SO 

Thr Leu Val Phe Gin Ala Pro Thr He Asn lie Leu Pro Pro Asn Gin 
35 9Q 95 

Thr lie Leu lie Ser Val Glu Ala Ser lie Ser Ser Ser Pro lie Arg 
1QQ 10S 110 

Asn Lys Trp Ala Leu His Leu He Thr Leu Leu Thr Gly Leu Gly lie ' 
LIS 12 Q 12 5 

Thr Ala Ala Leu Gly Thr Gly Ha Ala Gly lie Thr Thr Ser lie Thr 
130 135 140 

Ser Tyr Gin Thr Leu Phe Thr Thr Leu Ser Asn Thr Val Glu Asp Met 
145 ISO 155 ISO 

His Thr Ser lie Thr Ser Leu Gin Arg Gin Leu Asp Phe Leu Val Gly 
1S5 170 175 

Val He Leu Gin Asn Trp Arg Val Leu Asp Leu Leu Thr Thr Giu Lys 
130 135 190 

Gly Gly Thr Cys He Tyr Leu Gin Giu Glu Cys Cys Phe Cys Val Asn 
19S 200 205 

Giu Ser Gly He Val His He Ala Val Arg Arg Leu His Asp Arg Ala 

210 215 220 

Ala Glu Leu 
225 



<21Q> 127 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<40Q> 127 

Tyr Pro He Trp Glu Asn Glu Asn Thr Leu Ser Trp Tyr Leu Pro Ser 
15 10 15 

Pro Thr Thr Leu Leu Ser Pro Pro Val Leu Phe Cys Val 

20 25 



<210> 123 



34 



<211> 27 
<212> PRT 

.<2U> Homo sapians 
<4GQ> 123 

Arg Val Lau Asp Leu Leu Thr Thr GIu Lys Gly Gly Thr Cys lie Tyr 
IS 10 15 

Leu Gin Glu Glu Cvs Cys Phe Cys Val Asia Glu 
20 25 



<210> 129 

<211> 10 

<212> PRC 

<213> Home sapians 

<400> 129 

Phe Ser Leu Gly Arg Arg His Cys Lau Gly 
1 " 5 10 



<2I0> 13Q 
<211> 123 
<212> PRT 
<213> Homo sapiens 

<22Q> 

<221> SITE 
<222> (64) 

<223> Xaa equals any cf the naturally occurring L-amino acids 

<220> 
<22I> SITE 
<222> (33) 

<223> ICaa equals any of the naturally occurring L-aminc acids 
<400> 130 

Glu His Pro Thr Ala Asp Arg Ala Gly Cys Ser Ala Ser Gly Ala Cys 
15 10 15 

Tyr ser Leu His His Ala Thr Met Lys Arg Gin Ala Ala Glu Glu Ala 
20 25 30 

Cys lie Leu Arg Gly Gly Ala Leu Ser Thr Val Arg Ala Gly Ala Glu 
35 40 45 

Leu Arg Ala Val Lau Ala Leu Leu Arg Ala Gly Pro Gly Pro Gly Zaa 
50 55 SO 

Gly Ser Lys Asp Leu leu Phe Trp Val Ala Leu Glu Arg Arg Arg Ser 
65 70 75 ao 

His Cys Xaa Lau Glu Asn Glu Pro Leu Arg Gly Phe Ser Trp Leu Ser 
85 90 95 

Ser Asp Pro Gly Gly Leu Glu Ser Asp Thr Lau Gin Trp Val Glu Glu 
100 105 110 



35 



Pro Gin Arg Ser Cys 'Tor Ala Arg Arg Trp Val 
115 " 120 



<210> 131 
<211> 344 

<212> ppt 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (19) 

<223> Zaa equals any of che naturally occurring L-aminc acids 
<40Q> 131 

Sar Arg Pro Fro Val Gly Ser Sar Pro Gin leu Glu Gly Asp Ala Mac 
1 5 " 1Q 15 

Pro Pro Xaa Arg Gin Arg Tyr Leu Cys Lys Tyr Gin Phe Glu Val Leu 
2G 25 30 

Cys Pro Ala Pro Arg Pro Gly Ala Ala Sar Asn Leu Sar Tyr Arg Ala 
3 5 ' 40 45 

Pro Phe Gin Lau His Ser Ala Ala Lau Asp Pha Sar Pro Pro Gly Thr 
50 55 SO 

Glu Val Sar Ala Lau Cys Arg Gly Gin Lau Pro lie Sar Val Thr Cys 
65 70 75 SO 

He Ala Aso Glu lie Gly Ala Arg Trp Asp Lys Lau Ser Gly Asp Val 
35 90 95 

Leu Cys Pro Cys Pro Gly Arg Tyr Leu Arg Ala Gly Lys Cys Ala Glu 
100 105 110 

Leu Pro Asn Cys Leu Aso Aso Leu Gly Gly Phe Ala Cys Glu Cys Ala 
115 I2G 125 

Thr Gly Phe Glu Leu Gly Lys Asp Gly Arg Ser Cys Val Thr Ser Gly 
130 " 135 140 

Glu Gly Gin Pro Thr Leu Gly Gly Thr Gly Val Pro Thr Arg Arg Pro 
145 " 150 155 160 

Pro Ala Thr Ala Thr Ser Pro Val Pro Gin Arg Thr Trp Pro He Arg 
. . 165 170 175 

Val Asp Glu Lys Leu Gly Glu Thr Pro Leu Val Pro Glu Gin Asp Asn 
lio 135 ISO 

Ser Val Thr Ser He Pro Glu lie Pro Arg Trp Gly Ser Gin Ser Thr 

195 200 205 

Mec Ser Thr Leu Gin Mac Ser Leu Gin Ala Glu Ser Lys Ala Thr Ila 
210 215 220 

Thr Pro Ser Gly Ser Val lie Ser Lys Phe Asn Ser Thr Thr Ser Ser 
225 230 235 240 



86 



Ala Thr Fro Gin Ala Phe Asp Ser Ser Ser Ala Val Val Phe lis Phe 
243 250 255 

Val Ser Thr Ala Val Val Val Leu Val lie Leu Thr Mac Thr Val Lau 
260 265 270 

Glv Leu Val Lvs Lau Cys Phe His Giu Ser Pro Ser Ser Gin Pro Arg 
275 " " 28Q 285 

Lys Glu Ser Men Giy Pro Pro Gly Trp Arg Val Us Lau Lys Pro Ala 
290 295 300 

Ala Leu Glv Ser Ser Ser Ala His Cys Thr Asn Asn Gly Val Lys Val 
3Q5 ' 210 315 320 

Gly Asp Cys Asp Leu Arg Asp Arg Ala Glu Giy Ala Leu Leu Ala Glu 
325 330 335 

Ser Pro Leu Gly Ser Ser Asp Ala 

340 



<210> 132 
<211> 7 
<212> P5.T 

<213> Homo sapiens 
<4QQ> 132 

Arg Tvr Leu Thr Leu Thr His 
1 " 5 



<21Q> 133 
<2il> S 
<212> PP.T 

<213> Homo sapiens 
<400> 133 

Cys Asn Thr Pro Trp Ala 
1 5 



<21Q> 134 
<21i> 3 
<2L2> P9.T 

<213> Homo sapiens 
<4QQ> 134 

Ala Pro Val lie Phe Ser His Ser 
1 5 



<210> 125 
<211> S 
<212> PRT 

<213> Homo sapiens 



<400> 13S 



S7 



Arg Asn Val Pro Asp Asp 
1 5 



<2IQ> 13 5 

<211> 6 

<212> P5.T 

<213> Hcmo sapiens 

<400> 135 

Gly Leu Gla Asn Val Ser 
1 S 



<21Q> 137 

<211> 23 

<212> PRT 

<212> Homo sapiens 



Gly His Ser Leu Asp Asn Ser Leu Ser lie Lau Arg Thr 
S 10 15 

Phe Tyr MeC Leu Gly Val Arg 
20 



<21Q> 133 

<21i> S 

<212> PRT 

<213> Homo sapiens 

<400> 133 

Val Giu Gly Gly His Sar 
1 S 



<4QQ> 137 
Val Giu Gly 
1 



<210> 139 
<2L1> 190 
<212> PRT 
<213> Homo sapiens 

<40G> 139 

Thr Tro Lau Arg Lau Gly Sar Sar Gin lie Trp Lau Gly Tnr Ala Pro 
1 5 10 15 

Arg Giv Pro Arcr He His Pro Giu Gin Ala Gly Lau Ala Gly Ala Pro 
20 25 30 

Val Lys Sar Thr Ser Ser Giu Giu Sar Gin Pro Gly Gly Gin Cys Gin 
35 40 45 

Sar Sar Gly Gly Ala Gin Tnr Lau Pro Ser Leu Arg Ala Ala Pro Val 
50 55 SO 

Ala Ala Leu Gly Ser Lau Sar Ser Tyr Pro Asp Ser Cys Pro Arg Ala 
65 " 70 75 SO 

Thr Thr Pro Giu Leu Cys Pro Gly Ala Pro Thr Lau His Leu Ala Asp 



S3 



35 90 95 

Ser lis Ser Gly Pro Val Ser Pro Pro Gly Ser Ser Lau Gly Pro Asp 

LQQ 105 HQ 

Ala Trp Thr Lau Cys Ala Lys His His Gin Ala Lys Gly Mac Thr Lau 
115 120 125 

Gly Thr Pro Lys Val Lau Arg Lau Gin Pro Val Sar Pro Cys Trp Gly 
13 0 13 5 140 

Pro Lys Sar Trp Arg Val Pro Gly Pro Phe Gin Pro Gly Arg- Arg Arg 
145 ' 150 155 ISO 

Gly Glu Sar Arg Gin Gin Gly Arg Gly Lys Arg Arg Ser Ala Arg Sar 
L65 170 175 

Ala Gin Sar Pro Thr Gly Pro Glu Ser Ala Ala Trp Pro Cys 

13Q 135 130 



<210> 140 

<211> 129 

<212> PRT 

<213> Korao sapiens 

<400> 140 

Thr Val Ala Thr Ala Cys Val Trp Ala Ala Cys Thr Gly Cys Trp Ala 
15 10 15 

Arg Pro Pro Val Pro Thr Trp Ala Gly Cys Ala Ala Arg Cys Ala Ala 
20 25 3 0 

Glu Asp Ala Arg Ala Gly Val Gly Asp Leu Pro Ala Thr Gly Gly Ala 
35 40 45 

Ala Thr Gly Arg Arg Ala Leu Thr Pro Ala Pro Pro Arg Gly Pro Cys 
SO " 55 60 

lie Lau Ser Pro Gin Pro Trp Ala Leu Gly Leu Pro Gly Ala Pro Leu 
S5 70 75 , 30 

Pro Ala Ala Leu Pro Gly Arg Ala Arg Gly Arg Pro Gly Leu Pro Ala 
as 90 95 

Lau Pro Ala Leu Ser Thr Leu Pro Gly Cys Pro Ala Leu Asp Pro Ala 

10Q 105 110 

Gly Ala Gly Thr Leu Cys Pro Pro Pro Gly Ala Ala Glu Pro Ala Gly 
115 120 125 

Pro 



<2LQ> 141 

<211> 90 

<212> PRT 

<213> Homo sapiens 



89 



<4Q0> L41 

Arg Ser Gly Gin Pro Gly Glu Glv Ser Met Leu Arg Lys Phe Ser Leu 
1 5 10 " 15 

Gin Arg Leu Leu Ser Pro Leu Asp Glu Ala Glu Thr Arg Tro Glv Leu 
20 25 ' 2Q 

Ala Leu Ala Cys Val Ala Gly Asp Lys Gly Pro Pro Arg Pro Tro Asn 
35 40 45 

lie Ser Ser Ala Fro Ala His Pro His Val Thr Thr Pro Glv Me- Glu 
50 55 SO 

Thr Ser Gly Gly Pro Ala Arg Asp Gly Gly Leu lie Leu Glu Arg Glu 
S5 70 75 ' 30 

Ala Ala Phe Asn Lys Pro Ala Pro Gly Glu 
35 90 



142 
307 
PP.T 

Homo sapiens 



SITS 
(13 5) 

Xaa equals any of Che ca cur ally occurring L-amino acids 



SITE 
(157) 

Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (203) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITS 
<222> (219) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITS 
<222> (255) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 142 

Arg Cys Gin Arg Asn Lys Asp lie Met Met Ser Ser Lys Pro Thr Ser 
15 10 15 

His Ala Glu Val Asn Glu Thr lie Pro Asn Pro Tyr Pro Pro Ser Ser 
20 25 " 30 

Phe Met Ala Pro Gly Phe Gin Gin Pro Lau Gly Ser lie Asn Leu Glu 



<210> 
<211> 
<212> 
<213> 



<220> 
<22L> 
<222> 
<223> 



<220> 
<22L> 
<222> 
<223> 



90 



3 5 40 45 

Asn Gin Ala Gin Gly Ala Gin Arg Ala Gin Pro Tyr Giy lie Thr Ser 
50 55 60 

Pro Gly lie Phe Ala Ser Ser Gin Pra Giy Gin Gly Asn lis Gin Mec 
S5 * 70 75 30 

He Asn Pre Ser Val Giy Thr Ala Val Mec Asn Phe Lys Glu Glu Ala 

85 90 95 

Lys Ala Leu Gly Val He Gin He Mec Val Gly Leu Met His lie Giy 
LOO 105 110 

Phe Glv lie Val Leu Cys Leu lie Ser Pee Ser Phe Arg Glu Val Leu 
115 120 125 

Giy Phe Ala Ser Thr Ala Xaa lie Gly Gly Tyr Pra Phe Trp Gly Gly 

130 135 140 

Leu Ser Phe lie He Ser Gly Ser Leu Ser Val Ser Ala Ser Lys Glu 
145 150 LS5 ISO 

Leu Ser Arg Cys Leu Val Lys Giy Ser Leu Gly MeC Asn He Gly Arg 
165 17 Q 175 

Ser He Leu Ala Phe He Giy Val He Leu Leu Leu Val Asp Met: Cys 
130 135 190 

He Asn Gly Val Xaa Gly Gin Asp Tyr Trp Xaa Val Lau Ser Gly Lys 
195 200 205 

Gly He Ser Ala Thr Leu Met He Phe Ser Xaa Lau Glu Phe Phe Val 
210 21S 220 

Ala Cys Ala Thr Ala Kis Phe Ala Asn Gin Ala Asn Thr Thr Thr Asn 
225 " 230 235 240 

Met Ser Val Leu Val He Pro Asn. Met Tyr Glu Ser Asn Pro Xaa Thr 
245 250 255 

Pro Ala Ser Ser Ser Ala Pro Pro Arg Cys Asn Asn Tyr Ser Ala Asn 

2S0 2S5 270 

Ala Pro Lvs Arg Lvs Arg Gly Ha Ser Leu Ha Ser Trp Arg Lys Thr 
275 " 230 235 

Thr Cys Lys Asn Phe Leu Arg Arg Cys Leu Leu Lau Ser Thr Mec He 
290 * 295 300 

Ser Ser Leu 

205 



<21Q> 143 
<211> 246 
<212> PH.T 

<213> Homo sapiens 



91 



<4GQ> 143 

Met Gly Arg leu Asp Gly Lys Val lie lie Leu Thr Ala Ala Ala Glr. 
15 10 15 

Gly lie Gly Gin Ala Ala Ala Leu Ala Phe Ala Arg Glu Gly Ala Lvs 
20 25 30 

Val lie Ala Thr Asp He Asn Glu Ser Lys Leu Gin Glu Leu Glu Lys 
33 40 45 

Tyr Pro Gly He Gin Thr Arg Val Leu Asp Val Thr Lys Lys Lys Gin 
50 55 SO 

lie Asp Glr- She Ala Asn Glu Val Glu Arg Leu Asp Val Leu Phe Asn 
S5 70 75 80 

Val Ala Gly Phe Val His His Gly Thr Val Leu Asp Cys Glu Glu Lys 
35 90 95 

Asp Trp Asp Phe Ser Met Asn Leu Asn Val Arg Asn Val Met Tyr Leu 

100 105 110 

Met lie Lys Ala Phe Lau Pro Lys Met Leu Ala Gin Lvs Ser Glv Asn 
115 12Q 125 

Ila He Asn Met Ser Ser Val Ala Ser Ser Val Lys Gly Val Val Asn 
13 0 13 5 140 

Arg Cys Val Tyr Ser Thr Thr Lys Ala Ala Val lie Gly Leu Thr Lys 

145 150 1S5 ISO 

Ser Val Ala Ala Asp Phe He Gin Gin Gly He Arg Cys Asn Cys Val 
1S5 170 175 

Cys Pro Gly Thr Val Asp Thr Pro Ser Leu Gin Glu Arg He Gin Ala 
130 185 ~ 190 

Arg Gly Asn Pre Glu Glu Ala Arg Asn Asp Phe Leu Lys Arg Gin Lys 
195 200 2Q5 

Thr Gly Arg Phe Ala Thr Ala Glu Glu He Ala Met Leu Cys Val Tyr 
210 215 220 

Lau Ala Ser Asp Glu Ser Ala Tyr Val Thr Gly Asn Pro Val He He 
225 230 235 240 

Asp Gly Gly Trp Ser Leu 
245 



<21Q> 144 
<2I1> 234 
<212> PRT 
<213> Homo sapiens 



<4QQ> 144 

Gly Thr He Gly Leu Tyr Trp Val Gly Ser Ha He Mec Ser Val Val 
15 10 15 



92 



Val Phe Val Pro Gly Asn lie Val Gly Lys Tyr Gly Thr Arg lie Cys 

20 25 30 

Pro Ala Phe Phe Leu Ser lie Pro Tyr Thr Cys Leu Pro Val Trp Ala 
35 40 ' 45 

Gly Phe Arg lie Tyr Asn Gin Pro Ser Glu Asn Tvr Asn Tvr Pro Ser 

50 55 SO 

Lys Val lie Gin Glu Ala Gin Ala Lys Asp Leu Leu Arg Arg Pro Fhe 
S5 70 75 ' " 30 

Asp Leu Met Leu Val Val Cys Leu Leu Leu Ala Thr Gly Phe Cys Leu 
a5 90 " 95 

Fhe Arg Gly Lau lie Ala Leu Asp Cys Pro Ser Glu Leu Cys Arg Lau 
10Q 105 110 

Tyr Thr Gin Phe Gin Glu Pro Tyr Leu Lys Asp Pro Ala Ala Tyr Pro 
115 120 125 

Lys lie Gin Met Leu Ala Tyr Met Fhe Tyr Ser Val Pro Tyr Phe Val 
130 135 L4Q 

Thr Ala Leu Tyr Gly Leu Val Val Pro Gly Cys Ser Trp Met Pro Asp 
145 ISO 155 ISO 

lie Thr Lau lie His Ala Gly Gly Lau Ala Gin Ala Gin Phe Sar Kis 
ISS 170 175 

lie Gly Ala Sar Leu His Ala Arg Thr Ala Tyr Val Tyr Arg Val Pro 
130 135 190 

Glu Glu Ala Lys Ha Leu Phe Leu Ala Leu Asn lie Ala Tyr Gly Val 
195 2QQ 205 

Leu Pro Gin Lau Leu Ala Tyr Arg Cys lie Tyr Lys Pro Glu Phe Phe 
210 21S 220 

lie Lys Thr Lys Ala Glu Glu Lys Val Glu 
225 230 



<21G> 145 

<211> 233 

<212> P5.T 

<213> Homo sapiens 

<220> 

<221> SITZ 

<222> (134) 

<223> Zaa equals any of the naturally occurring L-amino acids 

<400> 145 

Met Ser Asn Kis Asp Pro Arg Gly Cys Thr Arg Arg Arg Ala Gin Lys 
1 5 10 * IS 



Pro Lau Ala lis Gin Pro Arg Leu Phe His Ala Ser Ala Pro Asp Glu 
20 2S : 3 0 



93 



Gly Thr Gin Gly Thr Lau Lys Gly Thr Gin Lys Gly Gly Cys He Leu 
3 5 40 45 

Val Glr. Cys Gin Ser Glu Gly Gly Ala Ala Gly Ala Trp Thr Gly Pro 
SO 55 SO 

Pro Ser Pro Ala Arg Asp Arg Arg Val Arg Pro Pro Gly Thr lys Ala 
65 70 75 30 

Gin Arg Leu Glu Arg Arg Arg His Val Pro Arg lau His Gly Lau Gly 
35 90 95 

Val Gly Gly Cys Glu Val Arg Thr Gly Ha Val Ala Arg lie Ser Gly 
100 105 110 

Ser Thr Pro Tra Ala Gly Gly Lys Pro Leu Gly Lau His Gly Ala Met 
115 120 125 

Gly Glu Ala Gly Ala Gly Asp Thr Gly Cys Cys Ala Lys Gly Pro Ser 

L3Q 135 140. 

Pro Ala Ala Pro Leu Pro Ala Glu Gly Arg Gly Gin Gly Ala Gly Pro 
145 ISO 155 ISO 

Gly Gly Leu Val Gly Arg Gly Glu Arg Arg Asp Gin Gin Thr Leu Leu 
155 170 175 

Gly Wet Ala Glu Asp Thr Gly ICaa Ser Pro Ser Arg Pro Ser Ala Pro 
ISO 135 190 

Ala Pro Arg Ala Pro Val Pro Ala Arg Gin Pro Leu Pro Arg Ala Arg 
195 200 205 

Leu Gly Ala Ala Thr Ala He Ser Lys Ser Arg Ser Ser Arg Val Ala 
210 215 220 

Pro Ala Leu Ala Ala Ala He Ser Ala Ser Ser His Gin Arg 
225 230 235 



<210> 146 
<2L1> 207 
<212> PUT 
<2I3> Homo sapiens 

<220> 

<221> SITS 
<222> (3) 

<223> Xaa equals any of che naturally occurring L-amino acids 
<22Q> 

<22i> SITE 
<222> (5) 

<223> Xaa equals any of ohe naturally occurring L-amino acids 
<220> 

<221> SITS 
<222> (9) 
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<223> Zaa equals any of the nacuraliy occurring L-amino acids 

<22Q> 
<221> SITS 
<222> (20) 

<223> Zaa equals any of the nacurally occurring L-amino acids 

<220> 

<221> SITE 
<222> (169) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<40Q> 14S 

Ser Thr Xaa Thr Xaa Thr lie Gly Xaa Ala Gly Thr Pro Ala Gly Thr 
15 10 15 

Gly Pro Giu Phe Pro Gly Arg Pro Thr Arg Pro Gly Glu Xaa Pro Val 

20 25 30 



Asp Phe Ser Lys Gin Tyr Ser Ala 

35 40 

Ala Ala Leu Ala Cys Ser Ala Gly 

SO 55 

Pro -Val Leu Ser Lys Ser Ser Pro 

65 70 



Pro lie Leu As? Asn Asn Ala Val 

130 



Ser Trp Met Cys Leu Ser Leu Leu 
45 

Asp Thr Trp Ala Ser Giu Val Gly 
60 

Arg Leu lie Thr Thr Trp Glu Lys 

75 30 

Gly Leu Val Ser 
95 

Phe Leu Thr Gin 

110 

Gin Trp Pro Xle 
125 

lie Val -Asp Ser 



Glu Ser Thr Gly 
ISO 

lie Ala Gly Lys 
175 

Asn Leu Phe Ser Ser Val Leu lie 

135 190 



Val Pro Val Gly Thr Asn 

as 

Ser Leu Leu Gly Gly Thr 

100 

Leu lie Phe Val Asn Asp 
115 

lie Ala Phe Gly Gly Leu 
13 0 

Tyr Leu Gly Ala Thr Wet 
145 150 

Met Val Val Asn Ser Pro 
1S5 



Gly Gly Val Thr Val Val 

90 

Phe Val Gly lie Ala Tyr 

105 

Leu Asp lie Ser Ala Pro 
12 0 

Ala Gly Leu Leu Gly Ser 
135 140 

Gin Tyr Thr Gly Leu Asp 
155 

Thr Asn Xaa Ala Arg Eis 

170 



Ala Leu Leu Leu Pro Thr Ala Ala Trp Gly Phe Trp Pro Arg Gly 
195 200 205 



<21G> 147 
<211> 116 
<212> PRT 

<213> Homo sapiens 



<400> 147 
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Mec Ser Gin Arc Ala Gly Arg Arg Pro Giy Gly Trp Asa Fro Ser leu 
15 10 15 

Ser Val Vai Glu Val Cys Arg Gly Cys Arg Gly Thr Gly Pro Lau Pro 
20 25 30 

Trp Gly Ala Ser Leu Phe Pro Cys Ser Ala Ser Fro Leu Phe Pro Leu 

35 40 45 

Pro Leu Asn Arg Arg Gly Asp Val His Gly Thr Lau Gly Gly Arg Mec 
50 55 SO 

Lau Asn Arg Vai Glu Cys Arg Asp Giy Val Ala Ala Ala Trp Lau Cys 
S5 70 75 • 30 

Lau His Asp Ala Ala Ala lie Arg Giy Ala Val Giy Arg Cys Fro Mec 
35 90 95 

Trp ■ Thr Gin Pro Thr His Trp Val Lau Leu Leu Cys Trp Ala Leu His 
LOO 10 5 110 

Phe Tyr Cys Arg 
115 



<210> 143 
<21i> 81 
<212> P3.T 

<213> Home sapiens 
<40Q> 143 

Met Thr Ala His Ser Phe Ala Lau Pro Vai lie lie Phe Thr Thr Phe 
IS 10 15 

T=n GLv Leu Val Giy lie Ala Gly Pro Trp Phe Val Pro Lys Gly Pro 
20 " 25 30 

Asn Arg Gly Vai lie lie Thr Met Leu Val Ala Thr Ala Val Cys Cys 
35 40 45 

Tyr Lau Phe Trp Lau lie Ala lie Lau Ala Gin Lau Asn Pro Leu Phe 
50 ' 55 SO 

Giy Pro Gin Lau Lys Asn Glu Thr lie Trp Tyr Val Arg Phe Leu Trp 
55 " 70 75 30 

Glu 



<21Q> 149 
<211> 110 
<212> PET 
<213> Homo sapiens 

<4Q0> 149 

Ala Gin Arg Ala Ala Arg Leu Gly Thr Arg Ala Pro Ala Ala Fro Ala 
1 ' 5 10 15 
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Ala Arg Pro Cys lis Leu Pro Gly His Pro Ala Pro Gly His Asp Gly 

20 25 30 

Ala Leu lie Arg Pro Pro Gly His His Lau His His Val Lau Gly Pro 
35 40 45 

Arg Arg His Arg Gly Pro Trp Ptie Val Pro Lys Gly Pro Asn Arg Gly 
50 55 SO 

Val lie lie Thr Met Leu Val Ala Thr Ala Val Cys Cys Tyr Lau Phe 

S3 70 75 80 

Trp Leu He Ala He Leu Ala Gin Leu Asr. Pro Leu Phe Gly Pro Gin 

35 SO 95 

Lau Lys Asn Giu Thr lie Trp Tyr Val Arg Phe Leu Trp Glu 

100 105 110 



<210> 150 
<211> 135 
<212> P3.T 
<213> Homo sapiens 

<40Q> 150 

Mac Thr Lau Glu Glu His Arg Asp Arg Pro Arg Leu Gly Met Cys Met 
IS 10 15 

Cys Val Cys Ala Cys Val Tyr Ala Cys Met Leu Met His Val Cys Val 
20 25 30 

His Ala Cys Leu Cys Val Cys Val Cys Val Cys Val Giu Pro Trp Ser 
35 40 45 

Ser Arg Gin Ser Lys Asp Thr Gly Gly Trp His Mec Glu Glu Gin Val 
50 " 55 SO 

Thr Pro Pro Ser Lau Ala Gin Lau Lys Ser Gly Gin Val Arg Gly Glu 
S3 70 75 30 

Met Gly Glu Gly Arg Gly Glu Lys Gly Giu Glu Ala Leu Thr Gly Gly 
35 90 95 

Ala Glu Ala Leu Ser Leu Lau Gly Arg Arg Ser Pro Ser Thr Pro Lau 
100 1QS HQ 

Phe Lau Asp Arg Glu Asp Lys Gin Ala Lys Asp Ala Arg Asn Leu Ser 
115 12 0 ' 123 

Ser Thr Val Ala Pro Asp Phe 
13 0 13 5 



<210> 151 

<21L> 32 

<212> PRT 

<213> Homo sapiens 



<4QQ> 151 
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His Glu Lys lie Lau Thr Pro He Trp Pro Ser Ser Thr Asp Leu Glu 
15 10 15 

Lys Pro His Glu Mec Lau. Phe Lau Asn Val lis Lau Phe Ser Lau Thr 
20 25 30 

Val Phe Thr Lau lie Ser Thr Ala Hi3 Thr Leu Asp Arg Ala Val Arg 
35 40 45 

Ser Asp Trp Lau Leu Lau Val Leu lie Tyr Ala Cys Leu Glu Glu Lau 
50 ' 55 SO 

He Pro Glu Lau He Phe Asn Leu Tyr Cys Gin Gly Asn Ala Thr Leu 
S5 70 75 80 

Phe Phe 



<210> 152 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<400> 152 

Pro Ala Asn Lys Ala Gly Ala Ala He Glu Ala Gly He Gly He Ser 
1 S 10 IS 

Leu Met Val Leu Ser Pro Trp Ala Cys Leu Phe Val Val Phe Phe Pro 
20 25 30 

Tyr He Gin Ser Ser Lau Arg Ser Asp Lys Kis Leu Gin Leu Ser Asn 
35 40 45 

He Leu Pro Thr Pro Ser Kis His Ha His Leu Pro Ala Ser Ha Cys 
50 55 SO 

He Gin Leu Arg Ala Gly Asn 
65 70 



<2IQ> 153 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<4QQ> 153 

Ala Glv Ser Pro Ala Gly Thr Gly Pro Glu Phe Pro Gly Arg Pro Thr 
1 5 10 IS 

Arg Pro He Ser Thr His Val Phe Glu Tyr Glu Cys He Cys Lys Ha 
20 25 30 

Pro Arg Phe Mec Cys Glu Tyr Val Leu Lau Leu Tyr He Val Leu Leu 
35 40 45 



Cys Asn Arg Ser Tyr Ala Val Phe Thr Gin Cys Val Lau Arg Ser Ser 
50 S3 SO 
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Pro II a Asp Ser Ser Arg Asn Ala Val Leu Lau 
63 70 75 



<21G> 154 

<211> 483 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (194) 

<223> Xaa equals any of die naturally occurring L-amino acids 
<220> 

<22i> SITE 

<222> (205) 

<223> Xaa equals any .of the nacuraiiy occurring L-amino acids 

<4QC> 154 

Met Pro Ser Giy Mec Ser Ala Ala Val Pro lie Ser Gly Leu Leu Asn 
1 5 10 15 

Leu Sar His Asn Ser lie Ser Gin Glu Ser Ala Lau Tyr Leu Leu Giu 
20 25 ' 30 

Thr Lau Pro Ser Cys Pro Arg Val Arg Giu Ala Ser Val Asn Leu Gly 
35 40 45 

Ser Giu Gin Ser Phe Arg lie His Fhe Ser Arg Glu Asp Gin Ala Giy 
50 55 SQ 

Lys Thr Leu Arg Leu Ser Giu Cys Ser Phe Arg Pro Giu His Val Ser 
65 70 75 30 

Arg Leu Ala Thr Giy Leu Ser Lys Ser Lau Gin Leu Thr Glu Lau Thr 
85 90 95 

Leu Thr Gin Cys Cys Leu Giy Gin Lys Gin Lau Ala lie Leu Leu Sar 

iaa ias no 

Leu Val Gly Arg Pro Ala Gly Leu Phe Ser Leu Arg Val Gin Giu Pro 
115 120 ' 125 

Trp Ala Asp Arg Ala Arg Val Leu Ser Leu Leu Giu Val Cys Ala Gin 
130 135 140 

Ala Sar Gly Ser Val Thr Glu lie Ser lie Ser Giu Thr Gin Gin Gin 
145 150 155 ISC 

Leu Cys Val Gin Leu Giu Phe Pro Arg Gin Glu Giu Asn Pro Glu Ala 
155 170 175 

Val Ala Lau Arg Lau Ala His Cys Asp Leu Gly Ala His His Ser Leu 
130 135 13Q 



Leu Xaa Gly Gin Leu Men Giu Thr Cys Ala Arg Leu Xaa Gin Leu Ser 
195 200 ' 2Q5 
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Lau Ser Gin Val Asn Leu Cys Glu Asp Asp Asp Ala Ser Ser Lau Lau 
210 215 ' 220 

Lau Gin Sar Lau Lau Lau Ser Lau Ser Glu Lau Lys Thr Phe Arg Lau 
225 23Q 235 " 240 

Thr Ser Ser Cys Val Ser Thr Glu Gly Lau Ala His Leu Ala Ser Gly 
245 250 255 

Lau Gly His Cys His His Leu Glu Glu Leu Asp Lau Ser Asn Asa Gin 
260 2S5 270 

Phe Asp Glu Glu Gly Tiir Lys Ala Lau Met Arg Ala Leu Glu Gly Lvs 
275 230 " 235 

Trp Mec Leu Lys Arg Lau Asp Lau Ser His Leu Leu Lau Asn Ser Ser 
290 235 300 

Thr Lau Ala Lau Leu Thr His Arg Leu Ser Gin Mec Thr Cys Lau Gin 

3 05 3L0 315 320 

Ser Lau Arg Lau Asn Arg Asn Ser lis Gly Asp Val Gly Cys Cys His 
3 25 33 0 '33 5 

Leu Ser Glu Ala Leu Arg Ala Ala Thr Ser Leu Glu Glu Lau Asp Leu 
240 345 350 

Ser His Asn Gin lie Gly Asp Ala Gly Val Gin His Leu Ala Thr lie 
355 360 365 

Leu Pro Gly Lau Pro Glu Lau Arg Lys Lie Asp Leu Ser Glv Asn Ser 
370 375 330 

lis Sar Ser Ala Gly Gly Val Gin Leu Ala Glu Ser Leu Val Lau Cys 
335 390 395 400 

Arg Arg Leu Glu Glu Leu Mec Leu Gly Cys Asn Ala Leu Gly Asp Pro 
405 410 ' 415 

Thr A-la Leu Gly Leu Ala Gin Glu Lau Pro Gin His Leu Arg Val Leu 
420 425 430 

His Leu Pro Phe Ser His Leu Gly Pro Gly Gly Ala Lau Ser Leu Ala 
435 440 445 

Arg Pro Trp Mec Asp Pra Pro lie Trp Lys Arg Ser Ala Trp Arg Lys 
450 455 460 

Thr Thr Trp Leu Glu Gly Ser Cys Val Ser Val Trp Ser Ser Arg Cys 
465 47Q ' 475 430 

Ser Asp Arg 



<210> 155 
<211> 221 
<212> FRT 
<213> Homo sapians 



100 



<40Q> 155 

Kis Gin Leu Ser Arg Gly Ser Ala Val Gly Arg Val Ser Arg Ser Leu 
IS LO L5 

Gin Ala Pro Gly Gly Val Asp Ala Trp Lau Gin Cys Pro Gly Gly Sar 

2Q 25 30 

His Sar Pro Gly Ala Gly Sar Gly Ala Ala Pro Ala Pro Giu Gly Pro 
35 40 45 

Thr Pro Thr lis Gin Pro Sar Gly Pro Arg Trp Gly Pro Giu Pro Gly 
50 55 60 

Gin Ala Lau Asp Gly Sar Pro His Lau Giu Giu lie Sar Lau Ala Giu 
55 ' 70 75 30 

Asn Asn Lau Ala Gly Gly Val Lau Arg Phe Cys Met: Giu Lau Pro Lau 

85 90 ' 35 

Lau Arg Gin He Asp Lau Val Sar Cys Lys Ha Asp Asn Gin Thr Ala 
100 105 110 

Lys Leu Lau Thr Ser Sar Phe Thr Sar Cys Pro Ala Lau Giu Val lie 
115 120 125 

Leu Leu Sar Trp Asn Lau Leu Gly Asp Giu Ala Ala Ala Giu Lau Ala 
130 135 140 

Gin Val Lau Pro Gin Mac Gly Arg Lau Lys Arg Val Asp Lau Giu Lys 
145 150 155 ISO 

Asn Gin lie Thr Ala Lau Gly Ala Trp Lau Lau Ala Giu Gly Lau Ala 
155 170 175 

Gin Gly Sar Ser He Gin Val lie Arg Leu Trp Asn Asn Pro He Pro 
ISO iaS 190 

Cys Asp Mac Ala Gin Kis Lau Lys Ser Gin Giu Pro Arg Lau Asp Phe 
195 200 205 

Ala Phe Phe Asp Asn Gin Pro Gin Ala Pro Trp Gly Thr 
210 215 220 



<210> 155 
<211> 39 
<212> PS.T 

<213> Homo sapiens 
<4QC> 155 

Giu Lys Lau Phe Cys Phe Giu Mat Lau Lau He Cys Lys Phe Ser Pro 
1 "S 10 15 

Asn Ser Val Pro Pro Giu Thr Cys Ala He Lau Asn Gin Gly Leu Met 
20 25 3Q 



Asp Leu Gly Lau Cys Arg MeC Cys Lau Gly Asn Asn Mec Phe Ala Gly 

35 40 - 45 
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Ser Met Leu Gly Lys Ser His Arg His Ser Pro Phe Ser lie Asn Gin 

50 55 SO 

Arg His Asn Ala Leu Arg Lys Ala Ala Gly Thr Fro Ala Gin Lys Ser 

S5 70 75 30 

Lau Gly lie Val Gin Val Ser Pro Asa 
85 



<210> 157 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<4Q0> 157 

Gly Cys Ala Gly Cys Ala Leu. Val Thr lie Cvs Lau Gin Ala Val Cys 
1 '5 10 15 

Lau Val Lys Ala lie Ala lie Leu Kis Ser Arg Leu Thr Arg As-o Thr 
2G 25 30 

Met His Cys Gly Arg Pro Gin Gly Pro Leu Pro Arg Lvs Ala Trn Val 
3 5 40 ~ 45 

Leu Ser Arg Phe Pro Pro Thr Glu Thr Ala 
50 55 



<210> 153 

<211> 43 

<212> PRT 

<212> Homo sapiens 

<4QQ> 153 

Pro Glu Thr Gin Cys Thr Ala Glu Gly Arg Arg Asra Pro Cys Pro Glu 
1 5 10 IS 

Lys Pro Gly Tyr Cys' Pro Gly Phe Pro Gin Leu Arg Gin Pro Glu lie 
.'20 25 " 30 

Trp Pro Arg Gly Lys Gly Lys Thr Leu His Pro Pro Ala Arg Kis Me: 
3S 40 45 



<21Q> 159 

<2ii> a 

<212> PRT 

<213> Homo sapiens 

<400> 159 

Ser Glu lie Gly Glu Asn Arg Pro 
1 5 
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<21Q> ISO 

<2XI> 8 

<212> PRT 

<213> Homo sapiens 

<4QG> ISO 

His Asp Thr Asp Ser Phe Ala His 

1 5 



<21Q> 161 

<2il> 7 

<212> FRT 

<213> Homo sapiens 

<400> 1S1 

Ala Leu Arc Lys Ala Ala Gly 
1 5 



<210> 1S2 
<2il> 143 
<212> PRT 
<213> Homo sapiens 

<4Q0> 1S2 

Met Arg Gly Pro Val Cys Gly Phe Ser Leu Val Giu Mec Leu Leu Ala 
1 " * S 10 15 

Leu Ala Leu Gly Leu Mec Leu lie Leu Gly Val Thr Gin lie Ala Leu 
20 25 30 

Ser Ser Arg Thr Thr Tyr Ala Ser Gin Ser Ala Ala Ser Leu Leu Gin 
35 40 45 

Asp Asp Ala Arg ?he Ala Leu Gly Lys Leu He Gin Glu lie Arg Gin 
SO " 55 SO 

Ala Gly Met Pha Gly Cys Leu Ser Ala Ala Sar lie Sar Asn Ala Pro 
S5 * " 70 75 30 

Ala Gly Phe Asp Arg Pro lie Gly Trp Ser Thr Thr Gly Sar Ser Arg 
35 90 95 

Ser Leu Thr Lau Val Thr. Ala Asp Val Gly Glu Gly Gly Ser Lys Pro 
100 105 110 

Asp Trp Thr Val Leu Ser Asp Cys Thr Gly Ser Ala His Ala Tyr Val 
115 120 125 

Gly Ser Pro Pra Ala Ala Asn Ala Arg Ala Asn Pro Leu Pro Thr Cys 
13 0 13 5 140 

Ala Lys Lau Thr 

145 



<210> 
<211> 



153 
137 
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<212> PRT 

<213> Homo sapiens 
<4QQ> 153 

Mec Gly Tyr Tyr Leu Ser Arg Ser Arg Gin Ala Gly Met Val Lau Leu 
15 10 15 

lie Ser Lau Val Phe Lau Lau Lau Lau Ala Leu Lau Gly Val Ser Ser 
20 25 30 

Met: Gin Gly Ala Lie Ser Gin Glu Lys lie Thr Gly Ser Lau Arg Gin 
35 40 45 

Arg Asn Gin Ser Phe Gin. Gin Ala Glu Ser Gly Leu Arg Lau Gly Glu 
50 55 SO 

Ser Leu Val Gin Ala Ser Gly Ptie Ala Leu Arg Pro Cys His Ser Thr 

S5 70 75 8 0 

Ala Ala Cys Ala Pro Pro Ala Glu Sar Val Ser Val Val Gly Pro Gly 
35 90 35 

Thr Asn Pro Val Ser Thr Val Thr Trp lie Gly Met Lys Asp Gly Val 
100 1Q5 110 

Tyr Gly lie Gin Asn Leu Gly Pro Gly Thr Gly Lau Val Asn Sar Arg 
115 120 125 

Gin Arg Pro Arg Pro Arg Ser lie Ala 
13 Q 13 5 



<210> 1S4 
<211> 2Q9 
<212> PRT 

<213> Homo sapiens 
<40Q> 1S4 

Glu Asn Glu Ser Thr Lys Glu Pro Ser Leu Lau Gin Tyr Leu Cys Val 
15 10 15 

Gin Ser Pro Ala Gly Lau Asn Gly Phe Asn Val Leu Leu Ser Gly Ser 

20 25 30 

Gin Thr Pro Pro Thr Val Gly Pro Ser Sar Gly Gin Lau Pro Ser Phe 
35 40 45 

Sar Val Pro Cys Mec Val Leu Pro Sar Pro Pro Leu Gly Pro Phe Pro 
50 55 SO 

Val Leu Tyr Ser Pro Ala Mec Pro Gly Pro Val Ser Ser Thr Leu Gly 
S5 ' 70 75 30 

Ala Leu Pro Asn Thr Gly Pro Val Asn Phe Ser Leu Pro Gly Leu Gly 
35 90 95 

Ser lie Ala Gin Leu Leu Val Gly Pro Thr Ala Val Val Asn Pro Lys 
L0Q 105 110 
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Ser Ser Thr Lau Pro Ser Ala Asp Pro Gin Leu Gin Ser Gin Pro Ser 
115 120 125 

Leu Asn Leu Ser Pro Val MeC Ser Arg Ser His Ser Val Val Gin Gin 
130 125 140 

Pro Glu Ser Pro Val Tyr Val Gly His Pro Val Ser Val Val Lys Leu 
145 150 155 ISO 

His Gin Ser Pro Val Pro Val Thr Pro Lys Ser lie Gin Arg Thr His 
155 170 175 

Arg Glu Thr Phe Phe Lys Thr Pro Gly Ser Lau Gly Asp Pro Val Leu 
ISO 135 190 

Lys Arg Arg Glu Arg Asn Asn His Glu Thr Pro Ala Arg Pro Arg Gly 

195 200 205 

Asp 



<2I0> 155 
<21i> 454 
<212> PRT 
<213> Home sapisns 

<4G0> 1S5 

Arg His Glu Arg His Glu Tyr Arg Arg Ala Leu Asp His Glu Glu Glu 
15 10 15 

Ala Leu Sar Sar Gly Ser Val Gin Glu Ala Glu Ala Mec Leu Asp Glu 
20 25 30 

Pro Gin Glu Gin Ala Glu Gly Ser Leu Thr Val Tyr Val lie Sar Glu 
35 40 45 

His Sar Ser Leu Leu Pro Gin Asp Mec Mec Ser Tyr He Gly Pro Lys 
SO 55 SO 

Arg Thr Ala Val Val Arg Gly He Mac Kis Arg Glu Ala Phe Asn lie 
65 70 75 SO 

He Gly Arg Arg He Val Gin Val Ala Gin Ala MeC Ser Lau Thr Glu 
35 90 95 

Asp Val Leu Ala Ala .Ala Lau Ala Asp His Leu Pro Glu Asp Lys Trp 
100 1Q5 110 

Ser Ala Glu Lys Arg Arg Pro Lau Lys Ser Ser Leu Gly Tyr Glu lie 

115 120 125 

Thr Phe Ser Leu Leu Asn Pro Asp Pro Lys Ser His Asp Val Tyr Trp 
130 135 140 

Asp lie Glu Gly Ala Val Arg Arg Tyr Val Gin Pro Phe Leu Asn Ala 
145 150 155 ISO 

Leu Gly Ala Ala Gly Asn Phe Ser Val Asp Sar Gin lie Leu Tyr Tyr 
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165 170 175 

Ala Met Isu Gly Val Asn Pro Arg Phe Asp Ser Ala Ser Ser Ser Tyr 
130 13 5 ISO 

Tyr Leu Asp Met His Ser Leu Pro His Val lie Asn Pro Val Glu Ser 
195 200 205 

Arg Leu Gly Ser Ser Ala Ala Ser Leu Tyr Pro Val Leu Asn Phe Leu 
210 " 215 220 

Leu Tyr Val Pro Glu Leu Ala His Ser Pro Leu Tyr lie Gin Asp Lys 
225 230 235 240 

Asp Gly Ala Pro Val Ala Thr Asn Ala Phe His Ser Pro Arg Trp Gly 
245 250 255 

Gly lie w.e: Val Tyr Asn Val Asp Ser Lys Thr Tyr Asn Ala Ser Val 
260 285 270 

Leu Pro Val Arg Val Glu Val Asp Met Val Arg Val Met Glu Val Phe ' 

275 280 285 

Leu Ala Gin Leu Arg Leu Leu She Gly He Ala Gin Pro Gin Leu Pro 
290 295 300 

Pro Lys Cys Leu Leu Ser Gly Fro Thr Ser Glu Gly Leu Met Thr Trp 
305 310 315 320 

Glu Leu Asp Arg Leu Leu Trp Ala Arg Ser Val Glu Asn Leu Ala Thr 
325 330 235 

Ala Thr Thr Thr Leu Thr Ser Leu Ala Gin Leu Leu Gly Lys lie Ser 
34Q 345 350 

Asn He Val lie Lys Asp Asp Val Ala Ser Glu Val Tyr Lys Ala Val 
355 360 365 

Ala Ala Val Gin Lys Ser Ala Glu Glu Leu Ala Ser Gly His Leu Ala 
370 * 375 330 

Ser Ala Phe Val Ala Ser Gin Glu Ala Val Thr Ser Ser Giu Leu Ala 

335 390 395 400 

Phe Phe Asp Pro Ser Leu Leu His Leu Leu Tyr Phe Pro Asp Asp Gin 
4Q5 410 415 

Lys Phe Ala lie Tyr 11a Pro Leu Phe Leu Pro Met Ala Val Pro He 
42Q 425 430 

Leu Leu Ser Leu Val Lys lie Phe Leu Glu Thr Arg Lys Ser Trp Arg 
435 440 445 

Lys Pro Glu Lys Thr Asp 
450 



<210> 166 
<211> 65 
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<212> PRT 

<2I3> Homo sapiens 
<4QQ> 165 

Lys Lau Leu Leu Thr Lys Val GIu Gin Lys Leu Glu Leu Ala Arg Leu 
1 5 10 IS 

Gin Val Asp Thr Ser Gly Ser Lys Glu Phe Gly Thr Ser Gly lie Pro 
20 25 " 3 0 

Ala Lys Cys Arg Phe Pro Lys I la Fhe Val Asn Thr Asp Asp Thr Tyr 
3 5 40 45 

Glu Glu Lau His Lau lie Val Tyr Lys Val Thr Thr Val Phe Leu Pro 
50 55 50 

Ala Lau 
SS 



<21Q> 157 
<211> 79 
<212> PUT 

<213> Homo sapiens 
<4Q0> 157 

Kec Glu Pro Gin Leu Gly Pro Glu Ala Ala Ala Leu Arg Pro Gly Trp 
15 10 15 

Leu Ala Lau Lau Leu Trp Val Ser Ala Leu Ser Cys Ser Phe Ser Leu 
20 25 3 0 

Pro Ala Ser Ser Leu Ser Ser Leu Val Pro Gin Val Arg Thr Ser Tyr 

25 40 45 

Asn Phe Gly Arg Thr Phe Leu Gly Leu Asp Lys Cys Asn Ala Cys He 
50 55 SO 

Gly Thr Ser He Cys Lys Lys Phe Phe Lys Glu Arg Asn Lys lie 
65 70 75 



<210> 153 
<211> 209 
<212> PH.T 
<213> Kcmo sapiens 

<4Q0> 153 

Gin Leu Pro Leu Trp Pro Ser Pro Ala Ser Val Gin Fro Arg Val Asp 
1 5 10 15 

Ser Gin Arg Ala Arg Gly Ser Pro Glu Fro Lys Mec Glu Pro Gin Leu 
20 25 20 

Gly Pro Glu Ala Ala Ala Leu Arg Pro Gly Trp Lau Ala Leu Leu Leu 
35 40 45 



Trp Val Ser Ala Leu Ser Cys Ser Phe Ser Leu Fro Ala Ser Ser Leu 
50 55 SO 
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Ser Ser Leu Vai Pro Gin Val Arg Thr Ser Tyr Asn Phe Giy Arg Thr 
55 70 75 ' SO 

Phe Leu Gly Leu Asp Lys Cys Asn Ala Cys lie Gly Thr Ser lie Cys 
35 90 95 

Lys Lys Pile Phe Lys Glu Glu lis Arg Ser Asp Asn Trp Leu Aia Ser 

10Q 105 110 

His Lau Gly Lau Pro Fro Asp Ser Leu Leu Ser 'Tyr Pro Ala Asn Tyr 
115 120 12 5 

Ser Asp Asp Ser Lys lie Trp Arg Pro Val Glu lie Phe Arg Leu Val 
12 0 13 5 140 

Ser Lys 'Tyr Gin Asn Glu lie Sar Asp Arg Lys lie Cys Aia Ser Aia 
145 ' 150 1S5 ISO 

Ser Ala Pro Lys Thr Cys Ser lie Glu Arg Val Lau Arg Lys Thr Glu 

1S5 170 175 " 

Arg Phe Gin Lys Trp Leu Gin Aia Lys Arg Leu Thr Pro Asp Leu Val 
130 185 190 

Gin Asp Cys His Gin Gly Gin Arg Glu Leu Lys Phe Lau Cys Met: Leu 
195 200 205 

Arg 



<210> 169 
<211> 145 
<212> PRT 

<213> Homo sapiens 
<220> 

<22I> SITS 
<222> (95) 

<223> ICaa equals any of Che nacuraiiy occurring L- amino acids 
<220> 

<22i> SITE 
<222> (107) 

<223> Xaa equals any of the naturally occurring L -amino acids 
<220> 

<221> SITS 
<222> (111) 

<223> Saa equals any of the nacuraiiy occurring L-amino acids 
<220> 

<22i> SITS 
<222> (115) 

.<223> 2aa equals any of che nacuraiiy occurring L-amino acids 
<22Q> 

<221> SITS 
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<222> (122) 

<223> Xaa equals any of the naturally occurring L- amine acids 
<22Q> 

<22i> SITE 
<222> (132) 

<223> Xaa equals any of the naturally occurring L- amino acids 
<40Q> 169 

Med Glu Pro Gin leu Giy Pro Giu Ala Ala Ala leu Arg Pro Gly Trp 
1 S 10 15 

Leu Ala Leu Leu Leu Trp Val Ser Ala Leu Ser Cys Ser She Ser Leu 
20 25 3 0 

Pro Ala Ser Ser Leu Ser Ser Leu Val Pro Gin Val Arg Thr Ser Tyr 

35 40 45 

Asia Phe Giy Arg Thr Phe Leu Giy Leu Asp Lys Cys Asn Ala Cys lie 
50 55 60 

Gly Thr Ser lie Cys Lys Lys Phe Phe Lys Glu Giu lie Arg Ser Asp 
65 70 75 30 

Asn Trp Leu Ala Ser His Leu Giy Thr Ala Ser Arg Phe Pro Leu Xaa 
35 90 95 

Ser Tyr Pro Cys Lys Leu Leu Gin Met lie Xaa Lys lie Trp Xaa Pro 
LQ0 105 ' 110 

Cys Gly Xaa Leu Leu Thr Giy Gin Gin Xaa Ser Asn Glu lie Sar Lys 
115 120 125 

Gin Glu lie Xaa Cys Leu Leu His Pro Pro Pro Lys Asn Leu His lie 
130 135 140 

Asp Val 
145 



<210> 170 

<211> 97 

<212> PH.T 

<213> Homo sapiens 

<4Q0> 170 

Giy Pro Arg Ala Arg Val Gin Gly Phe Ser Giy Ala Asp lie Val Lys 
15 10 IS 

Phe Mec Ala Leu Giy Ser Mec Tyr Leu Val Leu Thr Leu lie Val Ala 
20 25 30 

Lys Val Leu Arg Gly Ala Glu Pro Cys Cys Gly Pro Leu Lys Asn Arg 
35 40 45 

Val Leu Arg Pro Cys Pro Leu Pro Val His Cys Pro Leu Pro lie Pro 
50 55 60 

Ser Pro Ala Glu Giy He Pro Trp Val Ala Tyr Leu Pro lie Arg Trp 
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70 



75 



Ptie lis Ser Cys Cys Pro Gly His Cys lis Glzi Us Pro Met Cys Thr 
35 90 95 

Ser 
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